SEQUENCE LISTING 



a) GpfSPAL INFOPi^ATION: 

\) APPLICANT: 

(A) NAME: sanofi 

(3) STREET: 32-34 rue Marbeuf 

(C) CITY: PARIS 

(E) COUNTRY: FRANCE 

[F) POSTAL CODE (ZIP): 75008 
;) TELEPHONE: 01 53 77 40 00 

{^) TELEFAX: 01 53 77 41 33 

(ii) TITLiS, OF INVENTION: SR-p70 

(ill J NUMBERNoF SEQUENCES: 4 0 

(iv) COMPUTERNREADABLE FORM: 

(A) MEDIOM TYPE: Floppy disk 
(3) COMPU^R: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARa^ PatentIn Release #1.0, Version #1.25 (EPO) 

(2) INFORMATION FOR SEQ \d NO: 1: 

(i) SEQUENCE CHARACTEklSTICS ; 

(A) LENGTH: 2874 \ase pairs 

(B) TYPE: nucleic \cid 

(C) STRANDEDNESS: dVible 

( D ) TOPOLOGY : 1 inear^ 

(ii) MOLECULE TYPE:^DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cebus apellc^ 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 156. .2066 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1^ 

TGCCTCCCCG CCCGCGCACC CGCCCCGAGG CCTGTGCTCC tVtGAAGGGG ACGCAGCGAA 

GCCGGGGCCC GCGCCAGGCC GGCCGGGACG GACGCCGATG CCCfeGAGCTG CGACGGCTGC 

AGAGCGAGCT GCCCTCGGAG GCCGGTGTGA GGAAG ATG GCC TCC ACC ACC 

Met AXa GlX Ser Thr Thr 
1 \ 5 

ACC TCC CCC GAT GGG GGC ACC ACG TTT GAG CAC CTC TGG \gC TCT CTG 
Thr Ser Pro Asp Gly Gly Thr Thr Phe Glu His Leu Trp S>r Ser Leu 
10 15 ^ 

GAA CCA GAC AGC ACC TAG TTC GAC CTT CCC CAG TCA AGC CGg'^^ AAT 
Glu Pro Asp Ser Thr Tyr Phe Asp Leu Pro Gin Ser Ser Arg diy Asn 
25 30 35 ^ 

AAT GAG GTG GTG GGT GGC ACG GAT TCC AGC ATG GAC GTC TTC CAc\tA 
Asn Glu Val Val Gly Gly Thr Asp Ser Ser Met Asp Vai Phe His l\u 
40 45 50 ^ 

GAG GGC ATG ACC ACA TCT GTC ATG GCC CAG TTC AAT TTG CTG AGC AGC 
Glu Gly Met Thr Thr Ser Val Met Ala Gin Phe Asn Leu Leu Ser Ser 

60 65 70 

ACC ATG GAC CAG ATG AGC AGC CGC GCT GCC TCG GCC AGC CCG TAC ACC 
Thr Met Asp Gin Met Ser Ser Arg Ala Ala Ser Ala Ser Pro Tyr T^r 
^5 .80 85 



60 
120 
173 

221 

269 

317 

365 



51 



CCG GAG CAC GCC GCC AGC GTG CCC- ACC CAT TCA CCC TAG GCA CAG CCC 
Pro Glu His Ala Ala Ser Val Pro Thr His Ser Pro Tyr Ala Gin Pro 
90 95 100 



461 



.-^^.^ lyC ACC TTC GAC ACC ATG TCG CCC GCG CCT GTC ATC CCC TCC AAC- 
Ser SeV Thr Phe Asp- Thr Met Ser Pro Ala Pro Val lie Pro Ser Asn- 
.105 110 lis 

ACC GAC tVt CCC GGA CCC CAC CAC TTC GAG GTC ACT TTC CAG CAG TCC 
Thr Asp Ty\ Pro Gly Pro His His Phe Glu Val Thr Phe Gin Gin Ser 
120 \ 125 ■ 130 

AGC ACG GCC A^ TCA GCC ACC TGG ACG TAG TCC CCA CTC TTG AAG AAA 
Ser Thr Ala.Ly\ Ser Ala Thr Trp Thr Tyr Ser Pro Leu Leu Lys Lys 
i35 \ 140 145' 150 

CTC TAC TGC CAG A^ GCC AAG ACA TGC CCC ATC CAG ATC AAG GTG TCC 
Leu Tyr Cys Gin Ilk Ala Lys Thr Cys Pro lie Gin He Lys Val Ser 
.1S5\ 160 165 

GCC CCA CCG CCC CCG GOC- ACC GCC ATC CGG GCC ATG CCT GTC TAC AAG 
Ala Pro Pro Pro Pro Gly^Thr Ala He Arg Ala Met Pro Val Tyr Lys 
170 \\ 175 180 

AAG GCG GAG CAC GTG ACC GA>S ATC GTG AAG CGC TGC CCC AAC CAC GAG 
Lys Ala Glu His Val Thr AspXile Val Lys Arg- Cys Pro Asn His Glu 
185 X?0 195 

CTC GGG AGG GAC TTC AAC GAA Gg\ CAG TCT GCC CCA GCC AGC CAC CTC 
Leu Gly Arg Asp Phe Asn Glu GlyXcin Ser Ala Pro Ala Ser His Leu 
200 205 \ 210 

ATC CGT GTG GAA GGC AAT AAT CTC TcdvCAG TAT GTG GAC GAC CCT GTC 
lie Arg Val Glu Gly Asn Asn Leu Ser\in Tyr Val Asp Asp Pro Val 
215 220 \ 225 \ 230 

ACC GGC AGG CAG AGC GTC GTG GTG CCC TAtXgAG CCA CCA CAG GTG GGG ' 
Thr Gly Arg Gin Ser Val Val Val Pro Tyr \lu Pro Pro Gin Val Gly 
235 240 \ 245 



ACA GAA TTC ACC ACC ATC CTG TAC AAC TTC ATG^ 
Thr Glu Phe Thr Thr lie Leu Tyr Asn Phe Met 
250 255 



rGT AAC AGC AGC TGT 
Asn Ser Ser Cys 
260 



TG GGG GGC ATG AAC CGA CGG CCC ATC CTC ATC ATC \tC ACC CTG GAG 
Val Gly Gly Met Asn Arg Arg Pro lie Leu He He Thr Leu Glu 

265 270 27i 

ACG CGG GAT GGG CAG GTG CTG GGC CGC CGG TCC TTC GAG G^SiC CGC ATC 
Thr Arg Asp Gly Gin Val Leu Gly Arg Arg Ser Phe Glu GlV Arg He 
280 285 290 

TGC GCC TGT CCT GGC CGC GAC CGA AAA GCC GAT GAG GAC CAC TA^ 
Cys Ala Cys Pro Gly Arg Asp Arg Lys Ala Asp Glu Asp His Tyr TLrg 
295 300 305 

GAG CAG CAG GCC TTG AAT GAG AGC TCC GCC AAG AAc' GGG GCT GCC AGC^ 
Glu Gin Gin Ala Leu Asn Glu Ser Ser Ala Lys Asn Gly Ala Ala Ser 
315 320 325 

AAG CGC GCC TTC AAG CAG AGT CCC CCT GCC GTC CCC GCC CTG GGC CCG 
Lys Arg Ala Phe Lys Gin Ser Pro Pro Ala Val Pro Ala Leu Gly Pro 
330 335 340 

GGT GTG AAG AAG CGG CGG CAC GGA GAC GAG GAC ACG TAC TAC CTG CAG 
Gly Val Lys Lys Arg Arg His Gly Asp Glu Asp Thr Tyr Tyr Leu Gin 
345 350 355 

GTG CGA GGC CGC GAG AAC TTC GAG ATC CTG ATG AAG CTG AAG GAG AGC 
Val Arg Gly Arg Glu Asn Phe Glu He Leu Met Lys Leu Lys Glu Ser 
360 365 370 



509 



557 



605 



653 



701 



749 



797 



345. 



993 



9.41 



989 



1037 



1085 



1133 



1229 



1277 



52 



CTG GAG CTG ATG GAG TTG GTG CCG CAG CCG CTG GTA GAC TCC TAT CGG 
Leu Glu Leu Met Glu Leu Val Pro Gin Pro Leu Vai Asp Ser Tyr A^a 
3*^3^ 380 385 3^0 

CAG CA^AG CAG CTC CTA CAG AGG CCG AGT CAC CTA CAG CCC CCA TCC 
Gin Gin (Sin Gin Leu Leu Gin Arg Pro Ser His Leu Gin Pro Pro Ser 
^ 395 400 405 

TAC GGG CCG>^TC CTC TCG CCC ATG AAC AAG GTG CAC GGG GGC GTG AAC 
Tyr Gly Pro v\l Leu Ser Pro Met Asn Lys Val His Gly Gly Vai Asn 

415 420 

AAG CTG CCC TCC )s^C AAC CAG CTG GTG GGC CAG CCT CCC CCG CAC AGC 
.Lys Leu Pro Ser vVl Asn Gin Leu Val Gly Gin Pro Pro Pro His Ser 
425 \ 430 435 

TCG GCA GCT ACA CCC \aC CTG GGA CCT GTG GGC TCT GGG ATG CTC AAC 
Ser Ala Ala Thr Pro A\n Leu Gly Pro Val Gly Ser Gly Met Leu Asn 
440 \ 445 450 

AAC CAC GGC CAC GCA GTG \cA GCC AAC AGC GAG ATG ACC AGC AGC CAC 
Asn His Gly His Ala Val Ala Asn Ser Glu Met Thr Ser Ser His 

455 460 \ 465 470 



GGC ACC CAG TCC ATG GTC TCG 
Gly Thr Gin Ser Met Val Ser 
475 



TCC CAC TGC ACT CCG CCA CCC CCC 
GlV Ser His Cys Thr Pro Pro Pro Pro 
480 485 




TAC CAC GCC GAC CCC AGC CTC GTC A\T TTT TTA. ACA GGA TTG GGG TGT 
Tyr His Ala' Asp Pro Ser Leu Val Se\ Phe Leu Thr Gly Leu Gly Cys 
490 495\ 500 

CCA AAC TGC ATC GAG TAT TTC ACG TCC CAG GGG TTA CAG AGC ATT TAC 
Pro Asn Cys lie Glu Tyr Phe Thr Ser Gl\Gly Leu Gin Ser He Tyr 
505 510 \ - 515 

CAC CTG CAG AAC CTG ACC ATC GAG GAC CTG GGG GCC CTG AAG ATC CCC 
His Leu Gin Asn Leu Thr He Glu Asp Leu GlAAla Leu Lys He Pro 
520 525 >^30 

GAG CAG TAT CGC ATG ACC ATC TGG CGG GGC CTG CXfe. GAC CTG AAG CAG 
Glu Gin Tyr Arg Met Thr lie Trp Arg Gly Leu GlrAAsp Leu Lys Gin 
535 540 545 \ 550 

GGC CAC GAC TAC GGC GCC GCC GCG CAG CAG CTG CTC CG\ TCC AGC AAC 
Gly His Asp Tyr Gly Ala Ala Ala Gin Gin Leu Leu ArgXser Ser Asn 
555 560 \ 565 

GCG GCC GCC ATT TCG ATC GGC GGC TCC GGG GAG CTG' CAG CG^V CAG CGG 
Ala Ala Ala He Ser He Gly Gly Ser Gly Glu Leu Gin ArgNcin Arg 
570 575 580 ^ 

GTC ATG GAG GCC GTG CAC TTC CGC GTG CGC CAC ACC ATC ACC At\cCC 
Val Met Glu, Ala Val His Phe Arg Val Arg His Thr He Thr He\ro 
585 590 595 

AAC CGC GGC GGC CCC GGC GCC GGC CCC GAC GAG TGG GCG GAC TTC ^> 
Asn Arg Gly Gly Pro Gly Ala Gly Pro Asp Glu Trp Ala Asp Phe Gly 
600 605 610 

TTC GAC CTG CCC GAC TGC AAG GCC CGC AAG CAG CCC ATC AAG GAG GAG 
Phe Asp Leu Pro Asp Cys Lys Ala Arg Lys Gin Pro He Lys Glu Glu 

620 625 630 

TTC ACG GAG GCC GAG ATC CAC TGAGGGGCCG GGCCCAGCCA GAGCCTGTGC 
Phe Thr Glu Ala Glu He His 
635 

CACCGCCCAG AGACCCAGGC CGCCTCGCTC TCCTTCCTGT GTCCAAAACT GCCTCCGGAG 
GCAGGGCCTC CAGGCTGTGC CCGGGGAAAG GCAAGGTCCG GCCCATGCCC CGGCACCTCA 



13-2 5 



1373 



1421 



1469 



1517 



1565 



1613 



1661 



1709 



1757 



1305 



1353 



1901 



1949 



1997 



2045 



2156 
2216 



53 



CCGGCCCCAG 
AACCTTCTGG 
SJGCCACTGC 
CCiCCAGGCC 
AAAGAi^TCCT 
AGAGTGG^A 
TAACGGGCH 
ATTCTGCGGG 
GAAAACTGCG 
AGTGCCCCTC 
GCCTGTCCTT 



GAGAGGCCCA 
AGCTGCCCTA 
CGGGCGTGCT 
TCATCCTAGA 
CTTCGCTGGT 
GCAGCCAAGC 
GGGACTCTCT 
CGCCTCCT 
GAftGGCGCAA 
AGCCt^GCCA 
AGAGGAdCGG 



GCCACCAAAG 

ATGCTGGGCT 

CCATGGCAGG 

GACTCTGTCA 

GGACTGCCAA 

GACTGTGTCT 

CTGCTGGACT 

TCCTGCCCCT 

CCCCTCCCAG 

CAGTCACCTC 

AAATTGTCAA 



CCGCCTGCGG 

TGCGGGGCAG 

CGTGGGTGGG 

TCTGCCGATC 

AAAGTATTTT 

GAAACACCGT 

TGGGAGTGGC 

AACAACCACC 

GTGCGGGAAG 

TCCTTGGGGA 

TATTTGATAA 



ACAGCCTGAG 

GGGCCGGCCC 

GACCGCAGTG 

AAGCAAGGTC 

GCGACATCTT 

GCATTT7CAG 

CTTTGCCCCC 

AAAGTGTTGC 

CATCTGGTAC 

ACCCTGGGCA 

AATGATACCC 



TCACCTGCAG 

ACTCTCAGCC 

TCAGCTCCGA 

CTTCCAGAGG 

TTGGTTCTGG 

GGAATGTCCC 

AGCACACTGT 

TGAAATTGGA 

CGCCTCGGCC 

GAAAGGGACA 

TTTTCTAC 



(2) IMFORMATION FOR S^Q ID NO: 2: 

(i) -SEQUENCE CHARACTERISTICS: 
(A) LENGTH: SsJ amino acids 
-(B) TYPE: amin\acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(XI) SEQUENCE DESCRIPTIOnX SEQ TD NO: 2: 

Met Ala Gin Ser. Thr Thr Thr Ser\ro Asp Gly 
i 5 \ 10 

His Leu Trp Ser Ser Leu Glu Pro AspNser Thr 
20 25 

Gin Ser Ser Arg Gly. Aan Asn Glu Val ValNciy 
35 40 ^ 

Met Asp Val Phe His Leu Glu Gly Met Thr Thr^ 
50 55 

Phe Asn Leu Leu Ser Ser Thr Met Asp Gin Met 
S5 70 75 

Ser Ala Ser Pro Tyr Thr Pro Glu His Ala Ala 

85 90 

Ser Pro Tyr Ala Gin Pro Ser Ser Thr Phe Asp 
100 105 

Pro Val lie Pro Ser Asn Thr Asp Tyr Pro Gly 
115 120 

Val Thr Phe Gin Gin Ser Ser Thr Ala Lys Ser 
130 135 



Gly Thr Thr Phe Glu 
15 

Tyr Phe Asp Leu Pro 
30 

Gly Thr Asp Ser Ser 
45 

3er Val Met Ala Gin 



Ser 5ser 
Ser Val 
Thr Met 



Pro His 
125 

Ala Thr 
140 



Arg Ala Ala 

80 

fo Thr His 
95 

Ser ^o Ala 
110 

His Phe 



Trp Thr Tyr 



Ser Pro Leu Leu Lys Lys Leu Tyr Cys Gin He 
1^5 150 



Thr Cys Pro 
160 



Ala Lys 
Gly Thr 
Thr Asp 

Arg Cys Pro Asn His Glu Leu Gly Arg Asp Phe Asn Glu Gly Gin S r 



lie Gin He Lys Val Ser Ala Pro Pro Pro Pro 
165 170 

Ala Met Pro Val Tyr Lys Lys Ala Glu His Val 

. 180 185 



Ala He Arg 
175 

He Val Lys 
190 



2276 

2336 

2396 

2456 

2516 

2576 

2636 

2696 

2756 

2316 . 

2874 



54 



195 



200 



205 




Ala Pro Ala Set His Leu He Arg Val Glu Gly Asn Asn Leu Ser Gin 

210 215 220 

TvSr Val Asp Asp Pro Val Thr Gly Arg Gin Ser Val Val Val Pro Tvr 
225\ 230 . - 235 240 

;iu Pro Gin Val Gly Thr Glu Phe Thr Thr He Leu Tvr Asn Phe 

^ 245 - 250 255 

Met Cys A>n Ser Ser Cys Val Gly Gly Met Asn Arg Arg Pro He Leu 
\260 265 270 

lie He He tXx Leu Glu Thr Arg Asp Gly Gin Val Leu Gly Ara Ara 
275 \ 280 , 285 

Ser Phe Glu Gly He Cys Ala Cys Pro Gly Arg Asp Arg Lys Ala 

290 \ 295 300 

Asp Glu Asp His Tyr Xrg Glu Glh Gin Ala Leu Asn Glu Ser Ser Ala 
305 3!SD 315 320 

Lys Asn Gly Ala Ala Ser\y3 Arg Ala Phe Lys Gin Ser Pro Pro Ala 
325 \ 330 335 

Val Pro Ala Leu Gly Pro GlyVal Lys Lys Arg Arg His Gly Asp Glu 
340 \ 345 350 * 

Asp Thr Tyr Tyr Leu Gin Val Arg\ly Arg Glu Asn Phe Glu He Leu 
355 360 \ 365 

Met Lys Leu Lys Glu Ser Leu Glu Leu\tet Glu Leu Val Pro Gin Pro 
370 375 \ 380 

Leu Val Asp Ser Tyr Arg Gin Gin Gin GlnSLeu Leu Gin Arg Pro Ser 
395 390 Vp«; 400 

His Leu Gin Pro Pro Ser Tyr Gly Pro Val LeuNser Pro Met Asn Lys 
405 410 \ 415' 

Val His Gly Gly Val Asn Lys Leu Pro Ser Val AsnSsGln Leu Val Gly 
420 425 \ 430 

Gin Pro Pro Pro His Ser Ser Ala Ala Thr Pro Asn Leu\Gly Pro Vai 
435 440 445 

Gly Ser Gly Met Leu Asn Asn His Gly His Ala Val Pro AlaVAsn Ser 
450 455 460 

Glu Met Thr Ser Ser His Gly Thr Gin Ser Met Val Ser Gly SeVnis 



470 



475 



180 



Cys Thr Pro Pro Pro Pro Tyr His Ala Asp Pro Ser Leu Val Ser 
485 490 495 

Leu Thr Gly Leu Gly Cys Pro Asn Cys He Glu Tyr Phe Thr Ser Gin 
500 505 510 

Gly Leu Gin Ser He Tyr His Leu Gin Asn Leu Thr He Glu Asp Leu 
515 520 525 

Gly Ala Leu Lys He Pro Glu Gin Tyr Arg Met Thr He Trp Arg Gly 
530 535 ^' ^ y 

Leu Gin Asp Leu Lys Gin Gly His Asp Tyr Gly Ala Ala Ala Gin Gin 

550 555 560 

Leu Leu Arg Ser Ser Asn Ala Ala Ala He Ser He Gly Gly Ser Gly 
565 570 575 

Glu Leu Gin Arg Gin Arg Val Met Glu Ala Vai His Phe Arg Val Arg 



55 



.580, 



535 



590 



His 



625 



Thr He Thr He Pro Ash Arg G.ly Gly Pro Gly Ala Gly Pro Asp 
595 600 605 

Trp Ala Asp Phe Gly Phe Asp Leu Pro Asp Cys Lys Ala Arg Lvs 
no 615 620 

He Lys Giu Glu Phe Thr. Glu Ala Glu He His 
630, 635 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEOTENCE CHARACTERISTICS: 

(A)V.ENGTH: 2034 base pairs 
(3) \i?E: nucleic acid 

(C) smWiDEDNESS : double 

(D) T05^LOGY: linear 

(ii) MOLECULE TOPE: <=DNA 



(vi) ORIGINAL SOORCE: 

(A) ORGANISM^ Cebus apella 



(ix) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATION: ISS 



1652 




Cxi) SEQUENCE DESCRIPTIONS. SEQ ID NO: 3: 

TGCCTCCCCG CCCGCGCACC CGCCCCGAG\ CCTGTGCTCC TGCGAAGGGG ACGCAGCGAA 

GCCGGGGCCC GCGCCAGGCC GGCCGGGACG d>\CGCCGATG CCCGGAGCTG CGACGGCTGC 

AGAGCGAGCT GCCCTCGGAG GCCGGTGTGA GGAA(J ATG GCC CAG TCC ACC ACC 

SMec Ala Gin Ser Thr Thr 

* 5 

ACC TCC CCC GAT GGG GGC ACC ACG TTT .GAG Ci^ CTC TGG AGC TCT CTG 
Thr Ser Pro Asp Gly Gly Thr Thr Phe Glu HisVeu Trp Ser Ser Leu 
10 15 \ . 20 

GAA CCA GAC AGC ACC TAG TTC GAC CTX CCC CAG TCaVgC CGG GGG AAT 
Glu Pro Asp Ser Thr Tyr. Phe Asp Leu Pro Gin Ser Ster Arg Gly Asn 

25 30 . 31" 

AAT GAG GTG GTG GGT GGC ACG GAT TCC AGC ATG GAC GTC Xtc CAC CTA 
Asn Glu Val Val Gly Gly Thr Asp Ser Ser Met Asp Val Ph^His Leu 

40 , 45 50 ^ 

GAG GGC ATG ACC ACA TCT GTC ATG GCC CAG TTC AAT TTG CTG AG\ AGC 
Glu Gly Met Thr Thr Ser Val Met Ala Gin Phe Asn Leu Leu Server 
55 60 65 >ao 

ACC ATG GAC CAG ATG AGC AGC CGC GCT GCC TCG GCC AGC CCG TAG ACC^ 
Thr Met Asp Gin Met Ser Ser Arg Ala Ala Ser Ala Ser Pro Tyr Thr 

75 80 . . 85- 

CCG GAG CAC GCC GCC AGC GTG CCC ACC CAT TCA CCC TAC GCA CAG CCC 
Pro Glu His Ala Ala Ser Val Pro Thr His Ser Pro Tyr Ala Gin Pro 
90 95 100 

AGC TCC ACC TTC GAC ACC ATG TCG CCC GCG CCT GTC ATC CCC TCC AAC " 
Ser Ser Thr Phe Asp Thr Met Ser Pro Ala Pro Val He Pro Ser Asn 
105 110 lis 

ACC GAC TAT CCC GGA CCC CAC CAC TTC GAG GTC ACT TTC CAG CAG TCC 
ihr Asp "Tyr Pro Gly Pro His His Phe Glu Val Thr Phe Gin Gin Ser 
120 125 130 

AGC ACG GCC AAG TCA GCC ACC TGG ACG TAC TCC CCA CTC TTG AAG AAA 



60 
120 
173 

221 

269 

317 

365 

413 



557 



605 



56 



Ser 

135 

CTC 
Leu 



GCC 



AAG 



CTC 
Leu 



ATC 
lie 
215 

ACC 
Thr 



Thr Ala Lys Ser 



TAC TGC 
Tyr Cys 



CCA CCG 
Pro Pro 



GCGS.GAG 
Ala Xlu 

1' 

GGG AGG 
Gly Arg 

200 

CGT GTG 
Arg Val 



CAG ATC 
Gin He 
155 

CCC CCG 
Pro Pro 
170 

CAC GTG 
His Val 



Ala Thr 
140 

GCC AAG 
Ala Lys 



GGC ACC 
Gly Thr 

ACC GAC 
Thr Asp 



lC TTC 
Ak3 Phe 



GAA 

Glu Glj 



ACA 
Thr 



GGC AGG 
Gly Arg 



GAA TTC 
Glu Phe 




GTG 
val 



ACG 
Thr 



TGC 
Cys 
295 

GAG 
Glu 



GGG GGC 
Gly Gly 
265 

CGG GAT 
Arg Asp 

280 

GCC TGT 
Ala Cys. 



CAG CAG 
Gin Gin 



CAG AGC 
Gin Ser 
235 

ACC ACC 
Thr Thr 
250 

ATG AAC 
Met Asn 



AAC GAA 
Asn Glu 

205 - 

AAT AAT 

^^Asn Asn 
220 

Gm^GTG 
Val Val 



Trp Thr 

ACA TGC 
Thr Cys 



GCC ATC 
Ala lie 
175 

ATC GTG 
He Val 
190 

GGA CAG 
Gly Gin 



Tyr Ser 
145 

CCC ATC 
Pro lie 
160 

CGG GCC 
Arg Ala 



Pro Leu Leu Lys l 



CAG ATC 
Gin lie 



ATG CCT 
Met Pro 



AAG CGC 
Lys Arg 



TCT GCC 
Ser Ala 



CTC TCG 
Leu Ser 



GTG CCC 
Val Pro 



ATC CTG 
lie Leu 



GGG CAG 
Gly Gin 



CCT GGC 
Pro Gly 



GCC 
Ala 



AAG 
Lys 

GGT 
Gly 



GTG 
Val 



CTG 
Leu 
375 

CAG 
Gin 



CGC GCC 
Arg Ala 



GTG 
Val 



CGA 
Arg 

360 

GAG 
Glu 



AAG 
Lys 
345 

GGC 
Gly 



TTC 
Phe 
330 

AAG 
Lys 



TTG 
Leu 
315 

AAG. 
Lys 



CGA CGG 
Arg Arg 

GTG CTG 
Val Leu 
285 

CGC GAC 
Arg Asp 

300 

AAT GAG 
Asn Glu 



TAC AAC 
ir Asn 
255 

CCC Arc 
Pro ir 
270 

GGC CGC 
Gly Arg 



CAG TAT 
Gin Tyr 
225 

.TAT GAG 
Tyr Glu 
240 

TTC ATG 
Phe Met 



CTC ATC 
Leu He 



TCC 
Ar<JvSer 



CGA AAA 
Arg Lys 



AGC TCC 
Ser Ser 



CAG AGT 
Gin Ser 



CGG 
Arg 



CGC GAG 
Arg Glu 



CTG 
Leu 



CAG CAG 
Gin Gin 



TAC 
Tyr 



GGG CCG 
Gly Pro 



ATG GAG 
Met Glu 



CAG CTC 
Gin Leu 
395 

GTC CTC 
Val Leu 
410 



CGG CAC 
Arg His 

AAC TTC 
Asn Phe 
365 

TTG GTG 
Leu Val 
380 

CTA CAG 
Leu Gin 



CCC CCT 
Pro Pro 
335 

GGA GAC 
Gly Asp 
350 

GAG ATC 
Glu lie 



GCC GA1 
Ala Asp 
305 

GCC AAG 
Ala Lys 

320 

GCC GTC 
Ala Val 



GAG GAC 
Glu Asp 



TGC CCC 
Cys Pro 
195 

CCA GCC 
Pro Ala 

210 . 

GTG GAC 
Val Asp 



CCA CCA 
Pro Pro 



TGT AAC 
Cys Asn 



ATC ATC 
He He 
275 

TTC GAG 
Phe Glu 
290 

GAG GAC 
riu Asp 



AAG 
Lys 



GTC 
Val 
180 

AAC 
Asn 



GTG 
Val 
165 

TAC 
Tyr 



CAC 
His 



ys 
150 

TCC 
Ser 



AAG 
Lys 



GAG 
Glu 



AGC CAC CTC 
Ser His Leu 



GAC 
Asp 



CAG 
Gin 



AGC 
Ser 

260 

ACC 
Thr 



CCT 
Pro 



GTG 
Val 
245 

AGC 
Ser 



GTC 
Val 
230 

GGG 
Gly 



TGT 
Cys 



CTG GAG 
Leu Glu 



GGC CGC ATC 
Gly Arg He 



CAC 
His 



AAC 
Asn 



GCT 
Ala 



CCC GCC 
Pro Ala 



ACG TAC 
Thr Tyr 
355 



'G 
Le\ 
340^ 

TAC 
Tyr 



TAC CGG 
Tyr Arg 

•310 

GCC AGC 
Ala Ser 
325 

GGC CCG 
Gly Pro 



CAG 
, Gin 



CTG ATG 
Leu Met 



CCG CAG 
Pro Gin 



AGG CCG 
Arg Pro 



TCG CCC 
Ser Pro 



ATG AAC 
Met Asn 
415 



CCG CTG 
Pro Leu 
385 

AGT CAC 
Ser His 

400 

AAG GTG 
Lys Val 



AAG CTG 
Lys Leu 

370 

GTA GAC 
Val Asp 



CTA CAG 
Leu Gin 



CAC GGG 
His Gly 



AAG GAG AC 
Lys Glu Se] 



TCC 
Ser 



CCC 
Pro 



GGC 
Gly 
420 



TAT CGG 
Tyr Arg 
390 

CCA TCC 
Pro Ser 
405 

GTG AAC 
Val Asn 



653 



' 701 



749 



797 



845 



893 



941 



989 



1037 



1085 



1133 



1181 



1229 



1277 



AAG CTG CCC TCC GTC AAC CAG CTG GTG GGC CAG CCT CCC CCG CAC AGC 



1421> 



1469 



57 



425 

440 

AAC CAC GGCXCAC 
Asn His 
455 



430 



445 



460 



480 



490 \ 495 

CGAGCAGTAT CGCATGACCA TC^GGCGGGG CCTGCAGGAC 
CGGCGCCGCC GCGCAGCAGC TGCT^fGCTC CAGCAACGCG 
CTCCGGGGAG CTGCAGCGCC AGCGGOTCAT GGAGGCCGTG 
CATCACCATC CCCAACCGCG GCGGCCCCGG CGCCGGCCCC 
CTTCGACCTG CCCGACTGCA AGGCCCGCAa\cAGCCCATC 
CGAGATCCAC TGAGGGGCCG GGCCCAGCCA GA)^CTGTGC 
CGCCTCGCTC TC 



Gin 


Pro 


Pro 
435 


Pro 


His 


Ser 




GGC 
Gly 


TCT 
Ser 
450 


GGG 


ATG 
Met. 


CTC 
Leu 


AAC 
Asn 


1517 


GAG 
Glu 
465 


ATG 
Met 


ACC 
Thr 


AGC 
Ser 


AGC 
Ser 


CAC 
His 
470 


15 65 


TGC 
Cys 


ACT 
Thr 


CCG 
Pro 


CCA 
Pro 


CCC 
Pro 
485 


CCC 
Pro 


1613 


TGG 
Trp 


GGG. 
Gly 


CCC 
Pro 


TGAAGATCCC 


1662 



CTGAAGCAGG 
GCCGCCATTT 
CACTTCCGCG 
GACGAGTGGG 
AAGGAGGAGT 
CACCGCCCAG 



gccacgac'ta 
ccatcggcgg 

TGCGCCACAC 

cggacttcgg 
tcacggaggc 
agacccaggc 



1722 
1782 
1342 
1902 
1962 
2022 
2034 




(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 499 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4; 

Met Ala Gin Ser Thr Thr Thr Ser Pro Asp Gly Gly Thr\Thr Phe Glu 
^ 5 10 \ 15 

His Leu Trp Ser Ser Leu Glu Pro Asp Ser Thr Tyr Phe As\ Leu Pro 
20 25 30^ 

Gin Ser Ser Arg Gly Asn Asn Glu Val Val Gly Gly Thr Asp S^r Ser 
35 40 45 

Met Asp Val Phe His Leu Glu Gly Met Thr Thr Ser Val Met Ala 
50 55 60 

Phe Asn Leu Leu Ser Ser Thr Met Asp Gin Met Ser Ser Arg Ala Al 

70 75 80 

Ser Ala Ser Pro Tyr Thr Pro Glu His Ala Ala Ser Val Pro Thr His 
85 90 95 

Ser Pro Tyr Ala Gin Pro Ser Ser Thr Phe Asp Thr Met Ser Pro Ala 
100 105 110 

Pro Val lie Pro Ser Asn Thr Asp Tyr Pro Gly Pro His His Phe Glu 
115 120 125 

Val Thr Phe Gin Gin Ser Ser Thr Ala Lys Ser Ala Thr Trp Thr Tvr 
130 135 140 
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Ser Pro Leu Leu Lya Lys Leu Tyr Cys Gin lie Ala Lys Th'- 
1-15 150 155 

lie Gin lie Lys Vai Ser Ala Pro Pro Pro Pro Gly Th- Ala 
165 170 

AiSa Met Pro Val Tyr Lys Lys Ala Glu His Val Thr Asp He 

-30 135 , 190 

Arg Cys Pro Asn His Glu Leu Gly Arg Asp Phe Asn Glu Gly 
195 200 205 

Ala Pro Xla Ser His Leu He Arg Val Glu Gly Asn Asn Leu 
210 \ 215 220 

Tyr Val Asp Xsp Pro Val Thr Gly. Arg Gin Ser Val Val Val 
225 \ 230 235 

Glu Pro Pro Gin \ai Gly Thr Glu Phe Thr Thr He Leu Tyr 

250 

Met Cys Asn Ser Ser\:ys Val Gly Gly Met Asn Arg" Arg Pro 
260 \ 265 270 



lie He He Thr Leu Gli 
275 



Arg Asp Gly Gin Val Leu Gly 
280 285 




Ser Phe Glu Gly Arg He CysVAla Cys Pro Gly Arg Asp Ara 
290 295 \ 300 

Asp Glu Asp His Tyr Arg Glu GlXcin Ala Leu Asn Glu Ser 

310 \ 315 

Lys Asn Gly Ala Ala Ser Lys Arg Alk Phe Lys Gin Ser Pro 
.325 \330 

Val Pro Ala Leu Gly Pro Gly Val Lys Lyk Arg Arg His Gly 
340 345 \ 350 

.sp Thr Tyr Tyr Leu Gin Val Arg Gly Arg G^sl Asn Phe Glu 
355 360 \ 365 

Met Lys Leu Lys Glu Ser Leu Glu Leu Met Glu Lek Val Pro 
370 375 38 

Leu Val Asp Ser Tyr Arg Gin Gin Gin Gin Leu Leu G^K Ara 
335 390 .395 

His Leu Gin Pro Pro Ser Tyr Gly Pro Val Leu Ser Pro Met 
405 410 



Cys Pro 
160 

He Arg- 
175 

Val Lys 

Gin Ser 

Ser Gin 

Pro Tyr 
240 

Asn Phe 
255 

He Leu 

Arg Arg 

Lys Ala 

Ser Ala 
320 

Pro Ala 
335 

Asp Glu 
He Leu 
Gin Pro 



Pro Ser 
400 

Asn Lys 
\415 



Val His Gly Gly Val Asn Lys Leu Pro Ser Val Asn Gin Leu 
420 425 430 



Gin Pro Pro Pro His Ser Ser Ala Ala Thr- Pro Asn Leu Gly 
435 440 445 

Gly Ser Gly Met Leu Asn Asn His Gly His Ala Val Pro Ala 

450 , 455 

Glu Met Thr Ser Ser His Gly Thr Gin Ser Met Val Ser Gly 
465 470 ^ 

Cys Thr Pro Pro Pro Pro Tyr His Ala Asp Pro Ser Leu Val 
485 490 

Trp Gly Pro 



Gly 
Pro v^ 
Asn Ser 



Ser His 

480 

Arg Thr 
495 



(2) INFORMATION FOR SEQ ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 2156 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: cDNA 

<vi) ORIGINAL SOURCE: ' 

(A) ORGANISM: Homo sapiens 

(ix)\ FEATURE: 

(A) NAME/KEY: CDS 

[B) LOCATION: 33.. 1940 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GCGAGCTGCC CTCO^AGGCC GGCGTGGGGA AG ATG GCC CAG 

Met Ala Gin 
1 

TCC CCT GAT GGG GGOvACC ACGTTT GAG .C AC CTC TGG 
Ser Pro Asp Gly Gly Vhr Thr Phe Glu His Leu Trp 
10' \ IS 

■ CCA GAC AGC ACC TAG Tt\gAC CTT CCC CAG TCA AGC 
Pro Asp Ser Thr Tyr Phe Ysp Leu Pro Gin Ser Ser 
25 ^ 35 

■GAG GTG GTG GGC GGA ACG GATVTCC AGC ATG GAC GTC 
Glu Val Val Gly Gly Thr Asp \er Ser Met Asp Val 
"^0 45 \ 50 



TCC ACC GCC ACC 
Ser Thr Ala Thr 
5 

AGC TCT CTG GAA 
Ser Ser Leu Glu 
20 

CGG GGG AAT AAT 
Arg Gly Asn Asn 



TTC 
Phe 



GGC ATG ACT ACA TCT ■ GTC ATG GCC^ 
Gly Met Thr Thr Ser Val Met Ala 
60 



TTC AAT CTG 
[n Phe Asn Leu 
65 




ATG GAC CAG ATG AGC AGC CGC GCG GCC \cG GCC AGC 
Met Asp Gin Met Ser Ser Arg Ala Ala sWr Ala Ser 
75 80 

GAG CAC GCC GCC AGC GTG CCC ACC CAC TCG dcc TAC 
Glu His Ala Ala Ser Val Pro Thr His Ser pjp Tyr 
90 95 ^ 

TCC ACC TTC GAC ACC ATG TCG CCG GCG CCT GTC 
Ser Thr Phe Asp Thr Met Ser Pro Ala Pro Val Ilis 
105 110 115 

GAC TAC CCC GGA CCC CAC CAC TTT GAG GTC ACT TTC 
Asp Tyr Pro Gly Pro His His Phe Glu Val Thr Phe 
120 125 130 

ACG GCC AAG TCA GCC ACC TGG ACG TAC TCC CCG CTC 
Thr Ala Lys Ser Ala Thr Trp Thr Tyr Ser Pro Leu 
140 145 

TAC TGC CAG ATC GCC AAG ACA TGC CCC ATC CAG ATC 
Tyr Cys Gin He Ala Lys Thr Cys Pro He Gin He 
155 160 

CCG CCA CCC CCA GGC ACT GCC ATC CGG GCC ATG CCT 
Pro Pro Pro Pro Gly Thr Ala He Arg Ala Met Pro 

170 175 , 

GCG GAG CAC GTG ACC GAC * GTC GTG AAA CGC TGC CCC 
Ala- Glu His Val Thr Asp Val Val Lys Arg Cys Pro 
135 190 195 

GGG AGG GAC TTC AAC GAA GGA CAG TCT GCT CCA GCC 
Gly Arg Asp Phe Asn Glu Gly Gin Ser Ala Pro Ala 
200 205 210 



CTG 
Leu 



CCC 
Pro 



GCA 
Ala 
100 

CCC 

Pro 



CAC 
His 



AGC 
Ser 



TAC 
Tyr 
85 

CAA 
Gin 



CTG GAG 
Leu Glu 
55 

AGC ACC 
Ser Thr 
70 

ACC CCA 
Thr Pro 



CCC AGC 
Pro Ser 



TCC AAC ACC 
Ser Asn Thr 



CAGvCAG 
Gln\ln 



TTG AAG 
Leu Lys 



AAG GTG 
Lys Val 
165 

GTT TAC 
Val Tyr 
180 

AAC CAC 
Asn His 



TCC AGC 
Ser Ser 
135 

CTC 
Leu 



TCC 
Ser 



AAG AAA 
Lys Lys 



GAG CTC 
Glu Leu 



53 



101 



149 



197 



245 



293 



341 



389 



437 



485 



533 



AGC CAC CTC ATC 
Ser His L u He 
215 



677 



60 



CGC GTG 
Arg Val 



GGC AGG 
Gly .Arg 



GAA TTC 
Glu Phe 



GGG GGC 
Gly Gly 
265 

CGG GAT 
Arg Asp 
230 

GCC TGT 
Aia Cys 



GAA 
Glu 



CAG 
Gin 



Tf 

2i 

ATG 
Met 



GGC AAT AAT 
Gly Asn Asn 
220 

AGC GTC GTG 
Ser Val Val 

235 

ACC ATC CTG 
Thr He Leu 



CTC TCG 
Leu Ser 



GTG CCC 
Val Pro 



CGG CGG 
Asi^Arg Arg 



GGG CAG GTGvCTG 
Gly Gin Val 



TAG AAC 
Tyr Asn 
255 

CCC ATC 
Pro lie 
270 

GGC CGC 
Gly Arg 



CAG TAT 
Gin Tyr 
225 

TAT GAG 
Tyr Glu 
240 

TTC ATG 
Phe Met 



GTG GAT GAC 
Val Asp Asp 



CCA CCA CAG 
Pro Pro Gin 



CTC ATC 
Leu lie 



CGG TCC 
Arg Ser 



CCT GGC 
Pro Gly 



CGC GAC 
Arg Asp 
300 



AAA 
Lys 




CAG CAG 
Gin Gin 



CGT GCC 
Arg Ala 



GTG AAG 
Val Lys 
345 

CGA GGC 
Arg Gly 
360 



GCC 
Ala 



TTC 
Phe 
330 

AAG 
Lys 



CTG 
Leu 
315 

AAG 
Lys 



CGG 
Arg 



AAC GAG AGC t\c 
Asn Glu Ser Se, 



CAG AGC 
Gin Ser 



CGG CAT 
Arg His 



CGG GAG 
Arg Glu 



GAG CTG 
lu Leu 



CAG CAG 
Gin Gin 



GGG CCG 
Gly Pro 



CTG CCC 
Leu Pro 
4 25 

GCA GCT 
Ala Ala 
440 

CAT GGC 
His Gly 



ATG GAG 
Met Glu 



CAG CTC 
Gin Leu 
395 

GTC CTC 
Val Leu 
410 

TCC GTC 
Ser Val 



AAC TTT 
Asn Phe 
365 

TTG GTG 
Leu Val 
380 

CTA CAG 
Leu Gin 



CCC CCT 
Pro Pro 
335 

GGA GAC 
Gly Asp 
350 

GAG ATC 
Glu lie 



GCT- GAT 
Ala Asp 

305 

GCC AAG 
Ala Lys 
20 

GTC 
AlAval 



GAG GA? ^ 
Glu Asp' 



TGT AAC AGC 
Cys Asn Ser 
260 

ATC ATC ACC 
He lie Thr 
275 

TTT GAG GGC 
Phe Glu Gly 
290 

GAG GAC CAC 
Glu Asp His 



CCT GTC ACC 
Pro Val Thr 
230 

GTG GGG ACG 
Val Gly Thr 
245 

AGC TGT GTA 
Ser Cys Val 



CTG GAG ATG 
Leu Glu Met 



CGC ATC 
Arg He 



AAC GGG GCC 
Asn Gly Ala 



CCC 
Pro 



ACG 



CTG ATG 
Leu Met 



CCG CAG 
Pro Gin 



AGG CCG 
Arg Pro 



TCG 
Ser 



CCC ATG 
Pro Met 



ACA CCC 
Thr Pro 



CAC GCA 
His Ala 



GCC CAG 
Ala Gin 



CAC GCC 
His Aia 



TCC ATG 
Ser Met 
475 

GAC CCC 
Asp Pro 
490 



AAC 
Asn 



AAC 
Asn 



GTG 

Val' 
460 

GTC 
Val 



CAG 
Gin 



CTG 
Leu 

.445 

CCA 
Pro 



CTG 
Leu 
430 

GGG 
Gly 



GCC 
Ala 



AAC 
Asn 
415 

GTG 
Val 



CCC 
Pro 



AAC 
Asn 



CCA CTG 
Pro Leu 
385 

AGT CAC 
Ser His 
400 

AAG GTG 
Lys Val 



AAG^ 

Lys 

370 

GTG 
Val 



GCC CTT 
Ala Leu 
340 

TAC TAC 
Tyr Tyr 
355 

vCTG AAA 
»^u Lys 



TAC CGG 
Tyr Arg 
310 

GCC AGC 
Ala Ser 
325 

GGT GCC 
Gly Ala 



TGC 
Cys 
295 

GAG 
Glu 



AAG 
Lys 



GGT 
Gly 



CTT CAG GTG 
Leu Gin Val 



GAG AGC 
Glu Ser 



GAC ^C 
Asp sVr 



CTA CAG CCC 
Leu Gin Pro 



GGC CAG 
Gly Gin 

GTG GGC 
Val Gly 



GGC 
Gly 



TCG GGG TCC 
Ser Gly Ser 



AGC CTC GTC AGT 
Ser Leu Val Ser 
495 



CAC 
His 
480 

TTT 
Phe 



GAG 
Glu 
465 

TGC 
Cys. 



TTA 
Leu 



CAC GGG GGC 
His Gly Gly 
420 

CCT CCC CCG 
Pro Pro Pro 
435 

CCC GGG ATG 
Pro Gly Met 
450 

ATG AGC AGC 
Met Ser Ser 



TAT CGG 
Tyr Arg 

390 

;G TCC 
Pr\ Ser 
405 

ATG aA 
Met Asn 



CTG 
Leu 
375 

CAG 

Gin* 



TAC 
Tyr 

AAG 
sXys 



CAC AGT TCC 
His Ser Ser 



CTC AAC 
Leu Asn 



ACT CCG CCA 
Thr Pro Pro 



ACA GGA TTG 
Thr Gly Leu 
500 



AGC CAC 
Ser His 
470 

CCC CCC 
Pro Pro 
485 

GGG TGT 
Gly Cys 



AAC 
Asn 
455 

AGC 
Ser 



TAC 
Tyr 



CCA 
Pro 



725 



773 



821 



869 



917 



965 



1013 



1061 



1109 



1157 



1205 



1253 



1301 



1349- 



1445\ 



1493 



1541 



61 



AAC TGC ATC GAG TAT TTC ACC TCC CAA GGG TTA CAG AGC ATT TAC CAC 

Asn Cys He Giu Tyr Phe Thr Ser Gin Gly Leu Gin Ser lie Tyr His 
505 510 515 

'"•r, AAC CTG ACC ATT GAG GAC CTG GGG GCC CTG AAG ATC CCC GAG 

Leu GiSi Asn Leu Thr He Glu Asp Leu Gly Ala Leu Lys He Pro Glu 



52 0 



525 



530 



535 



CAG TAC cd^ATG ACC ATC TGG CGG GGC CTG CAG GAC CTG AAG CAG GGC 

Gin Tyr ArgNMet Thr He Trp Arg Gly Leu Gin Asp Leu Lys Gin Gly 
^ 540 545' 550 

CAC GAC TAC AGcNaCC GCG CAG CAG CTG CTC CGC TCT AGC AAC GCG GCC 

His Asp Tyr Ser t^hr Ala Gin Gin Leu Leu Arg Ser Ser Asn Ala Ala 
555 \ 560 



ACC ATC TCC ATC GGC 
Thr He Ser He Gly g\ 
570 



565 

TCA GGG GAA CTG CAG CGC CAG CGG GTC ATG 
Ser Gly Glu Leu Gin Arg Gin Arg Val Met 

575 • 580 



GAG GCC GTG CAC TTC CGC d^G CGC CAC ACC ATC ACC ATC CCC AAC CGC 

Glu Ala Val His Phe Arg Vi^ Arg His Thr He Thr He Pro Asn Arg 
585 590V . 595 

GGC GGC CCA GGC GGC GGC CCT OAC GAG TGG GCG GAC TTC GGC TTC GAC 

Gly Gly Pro Gly Gly Gly Pro Alo Glu Trp Ala Asp Phe Gly Phe Asp 

600 605 \ 610 615 

CTG CCC GAC TGC AAG GCC CGC AAG CSAG CCC ATC AAG GAG GAG TTC ACG 

Leu Pro Asp Cys Lys Ala Arg Lys o\n Pro He Lys Glu Glu Phe Thr 

620 \ 625 630 




GAG GCC GAG ATC CAC TGAGGGCCTC GCCTC 
Glu Ala Glu He His 
635 



:tgc agcctgcgcc accgcccaga 



GACCCAAGCT GCCTCCCCTC TCCTTCCTGT GTGTCCAAaA CTGCCTCAGG AGGCAGGACC 

ttcgggctgt gcccggggaa aggcaaggtc cggcccatcVccaggcacct CACAGGCCCC 

AGGAAAGGCC CAGCCACCGA AGCCGCCTGT GGACAGCCTG AfeTCACCTGC AGAACC 



1589 

1637 

1635 

1733 

1731 

1329 

1877 

1925 

1980 

2040 
2100 
2156 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i.) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 636 amino acids 

(B) TYPE: amino acid 
(D) . TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ala Gin Ser Thr Ala Thr Ser Pro Asp Gly Gly Thr Thr Phe ^ 

•1 ' 5 10 15 

His Leu Trp Ser Ser Leu Glu Pro Asp Ser Thr Tyr Phe Asp Leu Pro 
20 25 30 

Gin Ser Ser Arg Gly Asn Asn Glu Val Val Gly Gly Thr Asp Ser Ser 
35 40 45 

Met Asp Val Phe His Leu Glu Gly Met Thr Thr Ser Val Met Ala Gin 
50 55 60 

Phe Asn Leu Leu Ser Ser Thr Met Asp Gin Met Ser Ser Arg Ala Ala 

70 75 SO 

Ser Ala Ser Pro Tyr Thr Pro Glu His Ala Ala Ser Val Pro Thr His 
85 90 95 
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Ser Pro Tyr Ala Gin Pro Ser Ser Thr Phe Asp Thr Met Ser Pro Ala 

100 105 110 

Pro Val He Pro Ser Asn Thr Asp Tyr Pro Gly Pro His His Phe Glu 

115 120 125 



Val l: 

i: 



Phe Gin Gin Ser Ser Thr Ala Lys Ser Ala Thr Trp Thr Tyr 
135 .140 




Ser Pro L\u Leu Lys Lys Leu Tyr Cys Gin He Ala Lys Thr Cys Pro 
145 \ 150 155 160 

He Gin He iws Val Ser Thr Pro' Pro Pro Pro Gly Thr Ala He Arg 
165 170 175 

Ala Met Pro Val \yr Lys Lys Ala Glu His Val Thr Asp Val Val Lys 
180 \ 185 190 

Arg Cys Pro Asn HIsNgIu Leu Gly Arg Asp Phe Asn Glu Gly Gin Ser 
195 \ 200 205 

Ala Pro Ala Ser His LeiXlle Arg Val Glu Gly Asn Asn Leu Ser Gin 
210 XlS 220 

Tyr Val Asp Asp Pro Val Th\Gly Arg Gin Ser Val Val Val Pro Tvr 
225 230 \ 235 240 

Glu Pro Pro Gin Val Gly Thr g\ Phe Thr Thr He Leu Tyr Asn Phe 
245 \ 250 255 

Met Cys Asn Ser Ser Cys Val Gly d\y Met Asn Arg Arg Pro He Leu 
260 26^ 270 

He .He He Thr Leu Glu Met Arg Asp CSly Gin Val Leu Gly Arg Arg 
275 280 \ 285 

Ser Phe Glu Gly Arg He Cys Ala Cys ProSGly Arg Asp Arg Lys Ala 
290 295 \ 300 

Asp Glu Asp His Tyr Arg Glu Gin Gin Ala Leik Asn Glu Ser Ser Ala 
305 310 315\ 320 

Lys Asn Gly Ala Ala Ser Lys Arg Ala Phe Lya Glfa Ser Pro Pro Ala 
325 330 A 335 

Val Pro Ala Leu Gly Ala Gly Val Lys Lys Arg Arg H^s Gly Asp Glu 
340 345 \350 

Asp Thr Tyr Tyr Leu Gin Val Arg Gly Arg Glu Asn Phe ^u He Leu 
355 360 365 

Met Lys Leu Lys Glu Ser Leu Glu Leu Met Glu Leu Val Pro\ln Pro 
370 375 380 

Leu Val Asp Ser Tyr Arg Gin Gin Gin Gin Leu Leu Gin Arg Pro\er 

390 395 ^^p 

His Leu Gin Pro Pro Ser Tyr Gly Pro Val Leu Ser Pro Met Asn Lys^ 
405 410 43^5 

Val His Gly Gly Met Asn Lys Leu Pro Ser Val Asn Gin Leu Val Gly 
420 425 430 

Gin Pro Pro Pro His Ser Ser Ala Ala Thr Pro Asn Leu Gly Pro Val 
435 440 445 

Ir^ ^-^^ Ala val Pro Ala Asn Gly 

450 455 460 

Glu Met Ser Ser Ser His Ser Ala Gin Ser Met Val Ser Gly Ser His 



470 



475 



480 
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Cys Thr Pro Pro Pro Pro Tyr His Ala Asp Pro Ser Leu Val Ser Phe 
485 490 495 

-eiXlhr Gly Leu Gly Cys Pro Asn Cys He Glu Tyr Phe Thr Ser Gin 
500 505 510 

Gly Le\i Gin Ser He Tyr His Leu Gin Asn Leu Thr lie Glu Asp Leu 
^.5 15 520 525 

Gly Ala ^u Lys lie Pro Glu Gin Tyr Arg Mec Thr He Trp Ara Glv 
530 \ 535 540 - ^ ^ 

Leu Gin Asp^eu Lys Gin Gly His Asp Tyr Ser Thr Ala Gin Gin Leu 
545 .\ 550 555 560 

Leu Arg Ser SeV Asn Ala Ala Thr lie Ser He Gly Gly Ser Gly Glu 
\565 570 575 

Leu Gin Arg Gin ;C!rg Val 'Met Glu Ala Val His Phe Arg Val Arg His 
580 \ 585 590 

Thr He Thr He Pro Vsn Arg Gly Gly Pro Gly Gly Gly Pro Asp Glu 
•595 \ 600 605 

Trp Ala Asp Phe Gly PheVsp Leu Pro Asp Cys Lys Ala Arg Lys Gin 
610 eas 620 

Pro He Lys Glu Glu .Phe ThX^Glu Ala Glu He His 
625 630 \ 635 

(2) INFORMATION FOR SEQ ID NO: 

(i) .SEQUENCE CHARACTERISTICS^ 

(A) LENGTH: 2040 base paJS^s 
. (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(VI) ORIGINAL SOURCE: 

(A) ORGANISM: Mus musculus, 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 124.. 1890 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7; 

TGATCTCCCT GTGGCCTGCA GGGGACTGAG CCAGGGAGTA 

GACACCCAAG GAAACCTTGC TGGCTTTGAG AAAGGGATCG 

AGO ATG TGT ATG GGC OCT GTG TAT GAA TCC TTG 
Met Cys Met Gly Pro Val Tyr Glu Ser Leu 
1 5 10 

AAT TTG CTC AGC AGT GCC ATG GAC CAG ATG GGC 
Asn Leu Leu Ser Ser Ala Met Asp Gin Met Gly 
20 25 

GCG AGC CCC TAC ACC CCG GAG CAC GCC GCC AGC 
Ala Ser Pro Tyr Thr Pro Glu His Ala Ala Ser 
35 40 

CCC ■ TAC GCG CAG CCC AGC TCC ACC TTC GAC ACC 
Pro Tyr Ala Gin Pro Ser Ser Thr Ph Asp Thr 
50 55 



GATGCCCTGA GA< 

TCTCTCTCCT GCCC 

GCG CAG GCC CAG 
Gly Gin Ala Gin 



CCAAGG 



TTC 
Phe> 



AGC CGT 
Ser Arg 



GCG CCC 
Ala Pro 



ATG TCT 
M t Ser 
60 



GCG GCC 
Ala Ala 
30 

ACC CAC 
Thr His 
45 

CCG GCG 
Pro Ala 



GTC ATC CCT TCC AAT ACC GAC TAC CCC GGC CCC CAC CAC TTC GAG 



15 

CCG 
Pro 



TCG 
Ser 



CCT 
Pro 



GTC 



60 
120 
163 



312\ 



360 



64 



Val He 
65 

ACC TTC 
Thr ?he 

SO" 

CCA 
Pro LeuS 



Pro Ser Asn Thr 



CAG GAG 
Gin Gin 



TTG .AAG 
.eu Lys 



CAG ATC 
Gin lie 



ATG CCT 
Met Pro 



TGC CCC 

■ Cys Pro 
145 

CCG GCT 
Pro Ala 

160 

GTG GAT 
Val Asp 



GTG 
Lys^al 
dS 

GTC TAC 
Val Tyr' 
130 

AAC CAC 
Asn His 



TCG AGC 
Ser Ser 
35 

AAG TTG 
Lys Leu 
100 

TCG ACA 
Ser Thr 



Asp Tyr 
70 

ACT GCC 
Thr Ala 



Pro Gly Pro 



AAG TCG 
Lys Ser 



TAC TGT 
Tyr Cys 



CCA CCA 
Pro Pro 



AAG AAG 
.ys Lys 

GAGVCTT 
Glu \eu 



AGC CAG 
Ser His 



GAC CCT 
Asp Pro 



CCC CCA 
Pro Pro 



TGT AAC 
Cys Asn 



CAG GTG 
Gin Val 
195 

AGC AGC 
Ser Ser 
210 



GTC ATG^ 
Leu He 
165 

GTC ACG 
Val Thr 
180 

GGA ACA 
Gly Thr 



GCA GAG 
Ala Glu 
135 

GGA AGG 
Gly Arg 

150 

:gt gta 

Val 



CAG ATT 
Gin He 
105 

CCC CCG 
Pro Pro 
120 

GAT GTG 
His Val 



GCC 
Ala 
90 

GCT 
Ala 



His His 
75 

ACC TGG 
Thr Trp 



Phe Glu Val 



ACA TAC 
Thr Tyr 



AAG ACA 
Lys Thr 



GGC 
Gly 



ACG GCC 
Thr Ala 



ACC 
Thr 



GAC TTC AAT 
Asp Phe Asn 



GAA GGC 
Glu Gly 



GGA XgG 
Gly 



GAA TTT 
Glu Phe 



CAG AGT 
Gin Ser 
185 

P\G AGC 
ThX Thr 

200 



TGT GTG 
Cys Val 



GGG GGC 
Gly Gly 
215 



ATG 
Met Asn^ 



AA2 



AAC 
Asn 

170 

GTG 
Val 



ATG 
He 



CGG 
.Arg 



GAG ATT 
Asp He 

140 

GAA GGA 
Glu Gly 

15.5 

AAC GTG 
Asn Leu 



TGC GCC 
Cys Pro 

110- 

ATC CGG 
He Arg 
125 

GTT AAG 
Val Lys 



TGC 
Ser 
95 

ATC 
He 



GCC 
Ala 



GGC 
Arg 



CAG TCT GCC 
Gin Ser Ala 



GGC CAG 
Ala Gin 



GTT GTG 
Val Val 



GTG TAC 
Leu Tyr 



AGG CCC 
Arg Pro 

220 



GGG TAT 
Pro Tyr 

190 

AAC TTC 
Asn Phe 
205 

ATC CTT 
He Leu 



TAC . 

Tyr 

175 

GAA 
Glu 



ATG 
Met 



GTC 
Val 




TTC GAG 
Phe Glu 
240 

GAA GAC 
Glu Asp 



ACC GTG 
Thr Leu 



GGT GGC 
Gly Arg 

GAT TAC 
His Tyr 



AAT GGA 
Asn Gly 



CCT. GCC 
Pro Ala 



ATG TTG 
Met Phe 
305 

AAA GTG 
Lys Val 
320 

GTT GAC 
Val Asp 



GCT GCC 
Ala Ala 
275 

GTG GGT 
Leu Gly 
290 

TAC ATG 
Tyr Met 



GAG ACC 
Glu Thr 



ATC TGT 
He Cys 
245 

GGG GAG 
Arg Glu 
260 

AGC AAA 
Ser Lys 



CGG GAT 
Arg Asp 

230 

GCC TGT 
Ala Cys 



GAA CAG 
Gin Gin 



GGA GAG GT! 
Gly Gin Val^ 



CCT 
Pro 



GCT 
Ala 



GGT GCA 
Arg Ala 



ACC AAC 
Thr Asn 



CAC GTG 
His Val 



AAG GAG 
.Lys Glu 



TGC TAT 
Ser Tyr 



AGC GTA 
Ser Leu 

325 

CGA CAG 
Arg Gin 
340 



GTG AAG 
Val Lys 
295 

CGA GGC 
Arg Gly 
310 

GAA GTG 
Glu Leu 



GAG GAG 
Gin Gin 



TTC 
Phe 
280 

AAG 



GGC CGT 
Gly Arg 
250 

CTG AAT 
Leu Asn 

265 

AAG CAG 
Lys Gin 



CTG GGC 
.eu Gly 
5 

CGG 



GGC CGG TCT 
Arg Arg Ser 



GAA AGT 
Glu Ser 



AGC CCC 
Ser Pro 



AGA GGC 
Arg Arg 



CGG GAG AAC 
Arg Glu Asn 



ATG GAG CTT 
Met Glu L u 
330 

CAG CAG GAG 
Gin Gin Gin 
345 



CAC GGG 
His Gly 
300 

TTT • GAG 
Phe Glu 
3 IS 

GTG GCC 
Val Pro 



AAA GCT 
Lys Ala 



,ACC ACC 
:hr Thr 
270 

cctNgcc 

Pro Xla 
285 

GAC GAG^ 
Asp Glu 



GAT 
Asp 

255 

AAA 
Lys 



ATC 
He 



rAC 

Asp 



ATC TTG ATG 
He Leu Met 



CTG CTA 
Leu Leu 



GAG CCT TTG 
Gin Pro Leu 
335 

GAG AGG CGG 
Gin Arg Pro 
350 



408 



456 



504 



552 



600 



648 



696 



744 



792 



840 



388 



936 



984 



1032 



AGT CAC GTG CAG CGT CCA. TGC TAT GGG CCC GTG GTC TCG CCA ATG AAC 



1176 



1224 
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Ser His Leu Gin Pro Pro Ser Tyr Gly Pro Val 
355 360 

.^aiG GTA CAC GOT GGT GTC AAC AAA CTG CCC TCC 
Lvs Val His Gly Gly Val Asn Lys Leu Pro Ser 
^ 370 375 

-GGC c\g OCT CCC CCG CAC AGC TCA GCA GCT GGG 
Gly Gi;\?ro Pro Pro His Ser Ser Ala Ala Gly 
385 \ 390 

ATG GGC TOC GGG ATG CTC AAC AGC CAC GGC CAC 
Met Gly Ser\Gly Met Leu Asn Ser His Gly His 
400 \ -405 410 

GGT GAG ATG aA GGA GGC CAC AGC TCC CAG ACC 
Gly Glu Met Asn^Gly Gly His Ser Ser Gin Thr 
\20 425 

CAC TGC ACC CCG CcX CCC CCC TAT CAT GCA GAG 
His Cys Thr Pro Pro\Pro Pro Tyr His Ala Asp 
435 \ 440 

TTT TTG ACA GGG TTG GGOv-TGT CCA AAC TGC ATC 
?he Leu Thr Gly Leu GlyVys Pro Asn Cys lie 
450 \ 455 



Leu Ser 



GTC AAC 
Val Asn 
380 

CCC. AAC 
Pro Asn 
395 

AGC ATG 
Ser Met 



Pro Met Asn 

365 

CAG CTG GTG 
Gin Leu Val 



CTG GGG CCC 
Leu Gly Pro 



CCG 
Pro 



ATG GTT 
Met Val 



CCC AGC 
Pro Ser 




CAA GGG TTG CAG AGC ATC TAcScAC CTG CAG AAC 
Gin Gly Leu Gin Ser lie Tyr ^s Leu Gin Asn 

465 , 470 

CTT GGG GCT CTG AAG GTC CCT GAc\aG TAG CGT 
Leu Gly Ala Leu Lys Val Pro Asp Gin Tyr Arg 
480 485 \ 490 

GGC CTA CAG GAC CTG AAG CAG AGC CAT ^C TGC 
Gly Leu Gin Asp Leu Lys Gin Ser His Asb Cys 

500 50? 

CGC TCC AGC AGC AAC GCG GCC ACC ATC TCC Al^ 
Arg Ser Ser Ser Asn Ala Ala Thr He Ser Ile^ 
515 520 

CTG CAG CGG CAG CGG GTC ATG GAA GCC GTG CAT 
Leu Gin Arg Gin Arg Val Met Glu Ala Val His 
530 535 

ACC ATC ACA ATC CCC AAC CGT GGA GGC GCA GGT 
Thr lie Thr He Pro Asn Arg Gly Gly Ala Gly 
545 550 

GAC GAG TGG GCG GAC TTT GGC TTT GAC CTG CCT 
Asp Glu Trp Ala Asp Phe Gly Phe Asp Leu Pro 

560 565 

AAG CAG CCC ATC AAA GAG GAG TTC ACA GAG ACA 
Lys Gin Pro He Lys Glu Glu Phe Thr Glu Thr 
580 585 

TGAGGAACGT ACCTTCTTCT CCTGTCCTTC CTCTGTGAGA 

CCTGTTGGCT GTGCCCACAG AAACCAGCAA GGACCTTCTG 

GAAGTCGCTC ATGAACTAAC TCCCTCTTGG 



GAG TGC 
Glu Cys 
460 

CTT ACC 
Leu Thr 
475 

ATG ACC 
Met Thr 



GGC CAG 
Gly Gin 



TCG 
Ser 



CTC 
Leu 
445 

TTC 
Phe 



GCC- AAT 
Ala Asn 
415 

GGA TCC 
Gly Ser 
430 

GTC AGT 
Val Ser 



ACT TCC 
Thr Ser 



ATC GAG GAC 
He Glu Asp 



ATC 
He 



CAA 
Gin 



GGC GGC 
.Gly Gly 



TTC 

Phe Urg 

GCG GTG 
Ala Val 
555 

GAC TGC 
Asp Cys 



TCT 
Ser 
525 

GTG 
Val 



TGG AGG 
Trp Arg 
495 

CTG CTA 
.Leu Leu 
510 

GGC GAG 
Gly Glu 



CGC CAC 
Arg His 



^CA GGT CCC 
Gly Pro 



GAG AGC 
Glu Ser 



AAG TCC CGT 
Lys Se^ Arg 
,575 

CAC 
His 



AACTGCTCTT GGAAGTGGGA 
CCGGATGCCA TTCCTGAAGG 



1272 

1320 

1363 

1416 

1464 

1512 

1560 

1608 

1656 

1704 

1752 

1300 

1348 

1890 

1950 
)10 



(2) INFORMATION FOR SEQ ID' NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 589 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(li) MOLECULE TYPE: protein 

(xi) . SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

y.et Cys Met Gly Pro Val Tyr Glu Ser Leu Gly Gin Aia 
5 10 

Leu LeNii Ser Ser Ala Met Asp Gin" Met Gly Ser Arg' Aia 
20 25 

Ser Pro tV Thr Pro Glu His Ala Ala Ser Ala 
33^ 40 

Tyr Ala Gin ^o Ser Ser Thr Phe Asp Thr Met 
50 \ 55 



Pro Thr 
45 

Ser Pro 

60 



Gin. Phe Asn 
15 

Aia Pro Aia 
His Ser Pro 




lie Pro Ser AsnVhr Asp Tyr Pro Gly Pro His 
65 \ 70 75 

Phe Gin Gin Ser Se\ thr Ala Lys Ser Ala Thr 
35\ 90 

Leu Leu Lys Lys Leu tVt Cys Gin lie Ala Lys 
100 \ 105 

^le Lys Val Ser Thr Pro \ro Pro Pro Gly Thr 
115 \ 120 

Pro Val Tyr Lys Lys Ala GlAHis Val Thr Asp 
130 135 ^ 

Pro Asn His Glu Leu Gly Arg AsW Phe Asn Glu 

150 \ 155 

Ala Ser His Leu lie Arg Val Glu G^ Asn Asn 
165 \ 170 

Asp Asp Pro Val Thr Gly Arg Gin Ser Vkl Val 
180 185 

Pro Gin Val Gly Thr Glu Phe Thr Thr He ^u 
195 200 ^ 

Asn Ser Ser Cys Val Gly Gly Met Asn Arg Arg 

210 215 

He Thr Leu Glu Thr Arg Asp Gly Gin Val Leu 
225 230 235 

Glu Gly Arg He Cys Ala Cys Pro Gly . Arg Asp 
245 250 

Asp His Tyr Arg Glu Gin Gin Ala Leu Asn Glu 
260 265 

Gly Ala Ala Ser Lys Arg Ala Phe Lys Gin Ser 
275 280 

Ala Leu Gly Thr Asn Val Lys Lys Arg Arg His 

290 295 

Phe Tyr Met His Val Arg Gly Arg Glu Asn Phe 
305 310 21S 

Val Lys Glu Ser Leu Glu Leu Met Glu Leu Val 
325 330 

Asp Ser Tyr Arg Gin Gin Gin Gin Gin Gin Leu 
340 . 345 

His Leu Gin Pro Pro Ser Tyr Gly Pro Val Leu Ser Pro Met 



His Phe 

Trp Thr 

Thr Cys 

Ala He 
125 

He Val 
140 

Gly Gin 
Leu Ala 
Val Pro 



Tyr Asn 
205 

ro He 



Ala Pro Val 

Glu Val Thr 
80 

Tyr Ser Pro 
95 

Pro He Gin 
110 

Arg Ala Met 

Lys Arg Cys 

Ser Ala Pro 
160 

Gin Tyr Val 
175 

Tyr Glu Pro 
190 

Phe Met Cys 
Leu Val He 



Gly J\rg Arg 

Arg Lys^i^a 

Ser Thr Thr 
270 

Pro Pro Ala 
285 

Gly Asp Glu 

300 

Glu He Leu 
Pro Gin Pro 



Leu Gin Arg 
350 



Ser Phe 
240 

Asp Glu 

255 

.ys Asn 

Ile\pro 

Asp Met 

Met Lys 
320 

Leu Val 
335 

Pro Ser 
Asn Lys 
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^{ii) MOLECULE TYPE: protein 

ci) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

it CyAMet Gly Pro Val Tyr Glu Ser Leu Giy Gin Ala Gin Phe Asn 
1 \ 5 . 10 15 

-eu Leu 5^ Sef Ala Met Asp Gin Met Gly Ser^Arg Ala Ala Pro Ala 
20 25 30 

Ser Pro Tyr.^r Pro Glu His Ala Ala Ser Ala Pro Thr His Ser Pro 
35 \ 40 45 

Tyr Ala Gin ProNser Ser Thr Phe Asp Thr Met Ser Pro Ala Pro Val 

50 \ 55 ■ ■ 60 . 

lie Pro Ser Asn ThX Asp Tyr Pro Gly Pro His His Phe Glu Val Thr 
65 \ 70 75 80 



Phe Gin Gin Ser Ser 
85 



Ala Lys Ser Ala Thr Trp Thr Tyr Ser Pro 

90 . 95 



Leu Leu Lys Lys Leu .Tyr Ovs Gin He Ala Lys Thr Cys- Pro He Gin 
100 \ 105 110 

He Lys Val Ser Thr Pro Pro\ro Pro Gly Thr Ala He Arg Ala Met 
115 l\o 125 

Pro Val Tyr Lys Lys Ala Glu HisXval Thr Asp He Val Lys Arg Cvs 
130 135 \ 140" 

Pro Asn His Glu Leu Gly Arg Asp Ph\Asn Glu Gly Gin Ser Ala Pro 

1^*5 150 \ 155 . 160 



Ala Ser His Leu He Arg Val Glu Gly 
165 



Asn Leu Ala Gin Tyr Val 
175 




Asp Asp Pro Val Thr Gly Arg Gin Ser Val Vial Val Pro Tyr Glu Pro 
130 185 \ 190 

Pro Gin Val Gly Thr Glu Phe Thr Thr He Leu \z Asn Phe Met Cys 
195 200 \ 205 

Asn Ser Ser Gys Val Gly Gly Met Asn Arg Arg Pro\le Leu Val He 
210 215 220 

He Thr Leu Glu Thr Arg Asp Gly Gin Val Leu Gly ArgWg Ser Phe 

230 235 V 240 

Glu Gly Arg He Cys Ala Cys Pro Gly Arg Asp Arg Lys AiYasp Glu' 
245 250 A255 

Asp His Tyr Arg Glu Gin Gin Ala Leu Asn Glu Ser Thr Thr l\ Asn 
260 265 270 

Gly Ala. Ala Ser Lys Arg Ala Phe Lys Gin Ser Pro Pro Ala He PYo 

275 280 .285 

Ala Leu Gly Thr Asn Val Lys Lys Arg Arg His Gly Asp Glu Asp Met 
290 295 300 

Phe Tyr Met His Val Arg Gly Arg Glu Asn Phe Glu He Leu Met Lys 



315 



320 



Val Lys Glu Ser Leu Glu Leu Met Glu Leu Val Pro Gin Pro Leu Val 
^25 330 . 335 

Asp Ser Tyr Arg Gin Gin Gin Gin Gin Gin Leu Leu Gin Arg Pro Ser 

345 350 

His Leu Gin Pro Pro Ser Tyr Gly Pro Val Leu S r Pro Met Asn Lys 
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355 



360 



365 



val His Gly Gly Val Asn Lys Leu Pro Ser Vai Asn Gin Leu 
370 375 380 

^InSPro Pro Pro His Ser Ser Ala Ala Gly Pro Asn Leu Glv 
335 \ 390 395 

Gly Ser\Gly Met Leu Asn Ser His Gly His Ser Met Pro Ala 
405 410 




Glu Met AsnVGiy Gly His Ser Ser Gin Thr Met Val Ser Gly 
X20 425 430 

Cys Thr Pro ?r\ Pro Pro Tyr His Ala Asp Pro Ser Leu Val 
435 \ . . 440 445 

Leu Thr Gly Leu g\ Cys Pro Asn Cys He Glu Cys Phe Thr 
450 \ 455 ■ 460 

Gly Leu Gin Ser He rVr His Leu Gin Asn Leu Thr He Glu 
465 47ISI 47S; 

Gly Ala Leu Lys Val Pro Asp Gin Tyr Arg Met Thr lie Trp 
485 \ 490 

Leu Gin Asp Leu Lys Gin Ser\i3 .Asp Cys Gly Gin Gin Leu 
500 \ 505 510 

Ser Ser Ser Asn Ala Ala Thr Ilkser He Gly Gly Ser Gly 
515 520\ 525 

Gin Arg Gin Arg Val Met Glu Ala vi»i His Phe Arg Val Ara 

S3Q . ■ . - 535 ■ \ -540 

He Thr He Pro Asn Arg Gly Gly Ala ^y Ala Val Thr Glv 
545 550 \ 555 

Glu Trp Ala Asp Phe Gly Phe Asp Leu Pro X^p Cys Lys Ser 
565 570 

Gin Pro .He Lys Glu Glu Phe Thr Glu Thr Glu\er His 

580 585. 

(2) INFGRMATIOW FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 758 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mu3 musculus 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 389.. 757 



Val Gly 

Pro Met 
4 00 

Asn Gly 
415 

Ser His 

Ser Phe 

Ser Gin 

Asp Leu 
480 

Arg Gly 
495 

Leu Arg 
Glu Leu 
His Thr 



Pro Asp 
560 

Arg Lys 
575 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

TGGTCCCGCT TCGACCAAGA CTCCGGCTAC CAGCTTGCGG GCCCCGCGGA GGAGGAGACC 6^ 

CCGCTGGGGC TAGCTGGGCG ACGCGCGCCA AGCGGCGGCG GGAAGGAGGC GGGAGGAGCG 120 

GGGCCCGAGA CCCCGACTCG GGCAGAGCCA GCTGGGGAGG CGGGGCGCGC GTGGGAGCCA 130 

GGGGCCCGGG TGGCCGGCCC TCCTCCGCCA CGGCTGAGTG CCCGCGCTGC CTTCCCGCCG 24 0 



68 



GTCCGCCAAG AAAGGCGCTA AGCCTGCGGC AGTCCCCTCG CCGCCGCCTC CCTGCTCCGC 300 

ACCCTTATAA CCCGCCGTCC CGCATCCAGG CGAGGAGGCA ACGCTGCAGC CCAGCCCTCG 3 60 

CCGACGCCGA CGCCCGGCCC GGAGCAGA ATG AGC GGC AGC GTT GGG GAG ATG 412 

Met Ser Gly Ser Val- Gly Glu Met 
1 5 

GCC cAg ACC TCT TCT • TCC TCC TCC TCC'ACC TTC GAG CAC CTG TGG AGT 460 
Ala GlnXlhr Ser Ser Ser Ser Ser Ser Thr Phe Glu His Leu Trp Ser ' 
10 \ 15 20 



TCT CTA GA5. CCA GAC AGC ACC TAG TTT GAC CTC CCC CAG CCC AGC CAA 
Ser Leu Glu>Pro Asp Ser Thr Tyr Phe Asp Leu Pro Gin Pro Ser Gin 
25 \ 30 35 40 

GGG ACT AGC GAG\gcA TCA GGC AGC GAG GAG TCC AAC ATG GAT GTC TTC 
Gly Thr Ser Glu Ala Ser Gly Ser Glu Glu Ser Asn Met Asp Val Phe 

' SO . 55 , 

CAC CTG CAA GGC ATG >GCC CAG TTC AAT TTG CTC AGC AGT GCC ATG GAC 
His Leu Gin Gly Met Ala Gin Phe Asn Leu Leu' Ser Ser Ala Met Asp 
60 \ 65 70 

CAG ATG GGC AGC CGT GCG %CC CCG GCG AGC CCC TAC ACC CCG GAG CAC 
Gin Met Gly Ser Arg Ala AAa Pro Ala Ser Pro Tyr Thr Pro Glu His 
75 \ 80 85 

GCC GCC AGC GCG CCC ACC CAC \cG CCC TAC GCG CAG CCC AGC TCC ACC 
Ala Ala Ser Ala Pro Thr His Sdx Pro Tyr Ala Gin Pro Ser Ser Thr 
50 95 \ 100 

TTC GAC ACC ATG TCT CCG GCG CCT ^C ATC CCT TCC AAT ACC GAC TAC 
Phe Asp Thr Met Ser Pro Ala Pro Va^l He Pro Ser- Asn Thr Asp Tvr 

105 110 -\ 115 120 

CCC GGC CCC C 
Pro Gly Pro 



508 



556 



604 



652 



700 



74 8 



75 8 




(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ser Gly Ser Val Gly Glu Met Ala Gin Thr Ser Ser SerXSer Ser 
1 5 10 

Ser Thr Phe Glu His Leu Trp Ser Ser Leu Glu Pro Asp Ser Th\Tvr 
20 25 30 

Phe Asp Leu Pro Gin Pro Ser Gin Gly Thr Ser Glu Ala Ser Gly Sei 
35 40 45 

Glu Glu Ser Asn Met Asp Val Phe His Leu Gin Gly Met Ala Gin Phe 
50 55 

Asn Leu Leu Ser Ser Ala Met Asp Gin Met Gly Ser Arg Ala Ala Pro 

^5 70 75 90 

Ala Ser Pro Tyr Thr Pro Glu His Ala Ala Ser Ala Pro Thr His Ser 
85 90 95 

Pro Tyr Ala Gin Pro Ser Ser Thr Phe Asp Thr Met Ser Pro Ala Pro 
100 105 110 
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Vai lie Pro Ser Asn Thr Asp Tyr Pro Gly Pro 
lis 120 

f 2 INFORMATION FOR SEQ ID NO: 11: 

[1} SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 559 base pairs 
(3) TYPE: nucleic acid 
iC) STPANDEDNESS: double 
(D) TOPOLOGY: linear 

(ii) MOtECULE TYPE: cDNA 

(vi) ORIG^AL SOURCE: 

(A) ^ORGANISM: Homo sapiens 



ATTCCAGCAT 
TCCTGCCTCA 
CGTGACCCTT 
TGGCGTGTGC 
TCGCCACGGC 
CCTCGGCCAG 
CACAACCCAG 
ACTACCCCGG 
CCACCTGGAC 




(xi) SEQUENCES^DESCRIPTION: SEQ ID NO; 11: 
CGACCTTCCC CAGTCAAGKI^C GGGGGAATAA TGAGGTGGTG ' GGCGGAACGG 
GGACGTCTTC CACCTGGAG<^GCATGACTAC- ATCTGTCATG CATCCTCGGC 
CTAGCTGCGG AGCCTCTCCC <^TCGGTCCA CGCTGCCGGG CGGCCACGAC 
CCCCTCGGGC CGCCCAGATC CAT^^CCTCGt CCCACGGGAC ACCAGTTCCC 
AGACCCCCCG GCGCCTACCA TGCT(5rACGT CGGTGACCCC GCACGGCACC 
CCAGTTCAAT CTGCTGAGCA GCACCAf^A CCAGATGAGC AGCCGCGCGG 
CCCCTACACC CCAGAGCACG CCGCCAGCG^ GCCCACCCAC TCGCCCTACG 
CTCCACCTTC GACACCATGT CGCCGGCGCC \gTCATCCCC TCCAACACCG 
ACCCCACCAC TTTGAGGTCA CTTTCCAGCA GT^CAGCACG GCCAAGTCAG 
GTACTCCCCG CTCTTGAAG 

) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1764 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
ATGCTGTACG TCGGTGACCC CGCACGGCAC CTCGCCACGG CCCAGTTCAA TCTG^TGAGC 
AGCACCATGG ACCAGATGAG CAGCCGCGCG GCCTCGGCCA GCCCCTACAC CCCAGAd^AC 
GCCGCCAGCG TGCCCACCCA CTCGCCCTAC GCACAACCCA GCTCCACCTT CGACACCAI 
TCGCCGGCGC CTGTCATCCC CTCCAACACC GACTACCCCG GACCCCACCA CTTTGAGGTC 
ACTTTCCAGC AGTCCAGCAC GGCCAAGTCA GCCACCTGGA CGTACTCCCC GCTCTTGAAG 
AAACTCTACT GCCAGATCGC CAAGACATGC CCCATCCAGA TCAAGGTGTC CACCCCGCCA 
CCCCCAGGCA CTGCCATCCG GGCCATGCCT GTTTACAAGA AAGCGGAGCA CGTGACCGAC 
GTCGTGAAAC GCTGCCCCAA CCACGAGCTC GGGAGGGACT TCAACGAAGG ACAGTCTGCT 



60 
120 
180 
240 
300 
3 60 
420 
480 
540 
559 



60 
120 
180 
240 
300 
J60 
420^ 
480 
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CCAGCCAGCC ACCTCATCCG CGTGGAAGGC AATAATCTCT 
GTCACCGGCA GGCAGAGCGT CGTGGTGCCC TATGAGCCAC 
ACCACCATCC TGTACAACTT CATGTGTAAC AGCAGCTGTG 
CCZ?k7CcHa TCATCATCAC CCTGGAGATG CGGGATGGGC 
TTTGAGGGCC Nk:ATCTGCGC CTGTCCTGGC CGC'GACCGAA 
CGGGAGCAGC AGiKCCTGAA CGAGAGCTCC GCCAAGAACG 
ttcaagcaga gccc\cctgc CGTCCCCGCC CTTGGTGCCG 
GGAGACGAGG ACACGTAfTA CCTTCAGGTG CGAGGCCGGG 
AAGCTGAAAG AGAGCCTG^^ GCTGATGGAG TTGGTGCCGC' 
CGGGAGCAGC AGCAGCTCCT\iCAGAGGCCG AGTCACCTAC 
GTCCTCTCGC CCATGAACAA Gd^GCACGGG GGCATGAACA 

ctggtgggcc agcctccccc gcaAxgttcg GCAGCTACAC 

CCCGGGATGC TCAACAACCA TGGCCA^S^GCA GTGCCAGCCA 
CACAGCGCCC AGTCCATGGT CTCGGGGT^ CACTGCACTC 
GACCCCAGCC TCGTCAGTTT TTTAACAGGaVt GGGGT GTC 
ACCTCCCAAG GGTTACAGAG CATTTACCAC Ct)5CAGAACC 
GCCCTGAAGA TCCCCGAGCA GTACCGCATG ACCA^TGGC 
CAGGGCCACG ACTACAGCAC CGCGCAGCAG CTGCTC^OCT 
TCCATCGGCG GCTCAGGGGA ACTGCAGCGC CAGCGGGT' 
GTGCGCCACA CCATCACCAT CCCCAACCGC GGCGGCCCAG 
GCGGACTTCG GCTTCGACCT GCCCGACTGC AAGGCCCGCA 
TTCACGGAGG CCGAGATCCA CTGA 
(2) INFORMATION FOR SEQ ID NO: 13: ' 

(i) SEQUENCE CHARACTERISTICS: ■ 

(A) LENGTH: 587 amino acida 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: protein 



CGCAGTATGT 
CACAGGTGGG 
TAGGGGGCAT 
AGGTGCTGGG 
AAGCTGATGA 
GGGCCGCCAG 
GTGTGAAGAA 
AGAACTTTGA 
AGCCACTGGT 
AGCCCCCGTC 
AGCTGCCCTC 
CCAACCTGGG 
ACGGCGAGAT 
CGCCACCCCC 
CAAACTGCAT 
TGACCATTGA 
GGGGCCTGCA 
CTAGCAACGC 
TGGAGGCCGT 
GGCGGCCC 
AG^GCCCAT 



GGATGACCCT 

GACGGAATTC 

GAACCGGCGG 

CCGCCGGTCC 

GGACCACTAC 

CAAGCGTGCC 

GCGGCGGCAT 

GATCCTGATG 

GGACTCCTAT 

CTACGGGCCG 

CGTCAACCAG 

GCCCGTGGGC - 

GAGCAGCAGC 

CTACCACGCC 

CGAGTATTTC 

GGACCTGGGG 

GGACCTGAAG 

GGCCACCATC 

GCACTTCCGC 

TGACGAGTGG 

CAAGGAGGAG 



540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
12 60 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1764 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: . 

Met Leu Tyr Val Gly Aap Pro Ala Arg Hia Leu Ala Thr Ala Gin gyne 
1 5 10 15 

Asn Leu Leu Ser Ser Thr Met Asp Gin Met Ser Ser Arg Ala Ala Sei 

20 25 30 

Ala Ser Pro Tyr Thr Pro Glu Hia Ala Ala Ser Val Pro Thr Hia Ser 
35 40 45 

Pro Tyr Ala Gin Pro Ser Ser Thr Phe Asp Thr Met Ser- Pro Ala Pro 
50 55 60 

Val lie Pro Ser Aan Thr Aap Tyr Pro Gly Pro Hia Hia Phe Glu Val 
^5 70 75 80 
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Thr Phe Gin Gin Ser Ser Thr Ala Lya Ser Ala Thr Trp Thr Tyr Ser 
85 90 95 

Pro Leu Leu Lys Lys Leu Tyr Cys Gin lie Ala Lys Thr Cys Pro lie 
100 105 110 

il\lle Lya Val Ser Thr Pro Pro Pro Pro Gly Thr Ala lie Arg Ala 

'^115 - 120 125 

Met ?^ Val Tyr Lys Lys Ala Giu His Val Thr. Asp Val Val Lys Arg 
130^ 135 140 

Cys Pro Asn His Glu Leu Gly Arg Asp Phe Asn Glu Gly Gin Ser Ala 
145 \ 150 155 160 

Pro Ala SerXHis Leu lie Arg Val Glu Gly Asn Asn Leu Ser Gin Tyr 
165 170 175 

Val Asp Asp Pr^Val Thr Gly Arg Gin Ser Val Val Val Pro Tyr Glu 
180\ 185 190 

Pro Pro Gin Val gIsv Thr Glu Phe Thr Thr lie Leu Tyr Asn Phe Met 
195 \ 200 205 

Cys Asn Ser Ser Cys ^Xal Gly -Gly Met Asn Arg Arg Pro lie Leu lie 
210 \ 215 220 

lie lie Thr Leu Glu Met\Arg Asp Gly Gin Val Leu Gly Arg Arg Ser 
225 230 \ 235 240 

Phe Giu Gly Arg He Cys Alk Cys Pro Gly Arg Asp Arg Lys Ala Asp 
245 \ 250 255 

Glu Asp His Tyr Arg Glu Gin g\ii Ala Leu Asn Glu Ser Ser Ala Lys 
260 \,265 270 

Asn Gly Ala Ala Ser Lys Arg Ala ^»he Lys Gin Ser Pro Pro Ala Val 
275 280 \ 285 

Pro Ala Leu Gly Ala Gly Val Lys LysNfirg Arg His Gly Asp Glu Asp 
290 295 \ 300 

Thr Tyr Tyr Leu Gin Val Arg Gly Arg Gl^ Asn Phe Glu He Leu Met 
305 310 N315 320 

Lys Leu Lys Glu Ser Leu Glu Leu Met Glu Le^ Val Pro Gin Pro Leu 
325 330 \ 335 

Val Asp Ser Tyr Arg Gin Gin Gin Gin Leu Leu g\i Arg Pro Ser His 
340 345 \ 350 

Leu Gin Pro Pro Ser Tyr Gly Pro Val Leu Ser Pro Asn Lys Val 

355 360 ^ 



His Gly Gly Met Aan Lys Leu Pro Ser Val Asn Gin LeuVal Gly Gin 
370 375 380 

Pro Pro Pro His Ser Ser Ala Ala Thr Pro Asn Leu Gly Pr\val Gly 
385 390 395 \ 400 

Pro Gly Met Leu Asn Asn His Gly His Ala Val Pro Ala Asn gV Glu 
405 410 41? 

Met Ser Ser Ser His Ser Ala Gin Ser Met Val Ser Gly Ser His c\ys 
420 425 430 

Thr Pro Pro Pro Pro Tyr His Ala Asp Pro Ser Leu Val Ser Phe LeuN 
435 440 445 

Thr Gly Leu Gly Cys Pro Asn Cys II Glu Tyr Phe Thr Ser Gin Gly 
450 455 460 
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Leu Gin Ser lie Tyr Hi3 Leu Gin Asn Leu Thr He Glu Asp Leu Gly 

470 • 475 . 480 

Ala Leu Lys He Pro Glu Gin Tyr Arg Met Thr He Trp Arg Gly Leu 
485 490 

Asp Leu Lys Gin Gly His Asp Tyr Ser Thr Ala Gin Gin Leu Leu 
500 505 510 

Arg 3^ Ser Asn Ala Ala Thr lie Ser He Gly Gly Ser Gly Glu Leu 
\515 520 525 

Gin Arg G^ Arg Val Met Glu Ala-Val His Phe Arg Vai Arg His Thr 
530 \ 535 545 

He Thr He p\o Asn Arg Gly Gly Pro Gly Gly Gly Pro Asp Glu Tro 
545 \ 550 555 5 60 

Ala Asp Phe Gly ^ Asp Leu Pro Asp Cys Lys Ala Arg Lys Gin Pro 
56X 570 575 

He Lys Glu Glu Phe ^Shr Glu Ala Glu He His 
580 \ 585 

(2) INFORMATION FOR SEQ ID Nc\ 14: 

(i) SEQUENCE CHARACTERISTIC 

(A) LENGTH: 1521 base ^^irs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double^ 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo, sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
ATGCTGTACG TCGGTGACCC CGCACGGCAC CTCGCCACGG CC^GTTCAA TCTGCTGAGC 
AGCACCATGG ACCAGATGAG CAGCCGCGCG GCCTCGGCCA GCCCeTACAC CCCAGAGCAC 
GCCGCCAGCG TGCCCACCCA CTCGCCCTAC GCACAACCCA GCTCCA^T CGACACCATG 
TCGCCGGCGC CTGTCATCCC CTCCAACACC GACTACCCCG GACCCCACck CTTTGAGGTC 
ACTTTCCAGC AGTCCAGCAC GGCCAAGTCA GCCACCTGGA CGTACTCCCC OjTCTTGAAG 
AAACTCTACT GCCAGATCGC CAAGACATGC CCCATCCAGA TCAAGGTGTC CAC\CCGCCA 
CCCCCAGGCA CTGCCATCCG GGCCATGCCT GTTTACAAGA AAGCGGAGCA CGTGA\cGAC 
GTCGTGAAAC GCTGCCCCAA CCACGAGCTC GGGAGGGACT TCAACGAAGG ACAGTCT? 
CCAGCCAGCC ACCTCATCCG CGTGGAAGGC AATAATCTCT CGCAGTATGT GGATGACCC] 
GTCACCGGCA GGCAGAGCGT CGTGGTGCCC TATGAGCCAC CACAGGTGGG GACGGAATTC 
ACCACCATCC TGTACAACTT CATGTGTAAC AGCAGCTGTG TAGGGGGCAT GAACCGGCGG 
CCCATCCTCA TCATCATCAC CCTGGAGATG CGGGATGGGC AGGTGCTGGG CCGCCGGTCC 

tttgagggcc gcatctgcgc ctgtcctggc cgc.gaccgaa aagctgatga ggaccactac 

CGGGAGCAGC AGGCCCTGAA CGAGAGCTCC GCCAAGAACG GGGCCGCCAG CAAGCGTGCC 
TTCAAGCAGA GCCCCCCTGC CGTCCCCGCC CTTGGTGCCG GTGTGAAGAA GCGGCGGCAT 
GGAGACGAGG ACACGTACTA CCTTCAGGTG CGAGGCCGGG AGAACTTTGA GATCCTGATG 



60 
120 
ISO' 
240 
300 
360 
420 
480 
540 
600 
560 

i: 

780 
340 
900 
960 
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AAGCTGAAAG AGAGCCTGGA GCTGATGGAG TTGGTGCCGC 
CGGCAGCAGC AGCAGCTCCT ACAGAGGCCG CCCCGGGATG 
AGT&CCAGCC AACGGCGAGA TGAGCAGCAG CCACAGCGCC 
CCACTOCACT CCGCCACCCC CCTACCACGC CGACCCCAGC 
CTGAAGATSZC CCGAGCAGTA CCGCATGACC ATCTGGCGGG 
ggccacgacKacagcaccgc GCAGCAGCTG CTCCGCTCTA 
ATCGGCGGCT C^^GGGGAACT GCAGCGCCAG CGGGTCATGG 
CGCCACACCA TcAcATCCC CAACCGCGGC GGCCCAGGCG 
GACTTCGGCT TCGACOTGCC CGACTGCAAG GCCCGCAAGC 
ACGGAGGCCG AGATCCAQ^G A 
(2) INFORMATION FOR ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5(76 amino acids 

(B) TYPE.: aminos. acid 
(D) TOPOLOGY: liVear 

(ii) MOLECULE TYPE: protVin 



AGCCACTGGT 
CTCAACAACC 
CAGTCCATGG 
CTCGTCAGGA 
GCCTGCAGGA 
GCAACGCGGC 
AGGCCGTGCA 
GCGGCCCTGA 
AGCCCATCAA 



GGACTCCTAT 
ATGGCCACGC 
TCTCGGGGTC 
CCTGGGGGCC 
CCTGAAGCAG 
CACCATCTCC 
CTTCCGCGTG 
CGAGTGGGCG 
GGAGGAGTTC 



1020 

108 0 

1140 

1200 

1260 

1320 

1330 

1440 

1500 

1521 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15; 



Met Leu Tyr Val Gly Asp Pro AlW Arg His Leu Ala Thr Ala 

1 5 ■ \ 10 - 



Asn Leu Leu Ser Ser Thr Met Asp GPn Met 
20 25 

Ala Ser Pro Tyr Thr Pro Glu His Ala 
35 40 

Pro Tyr Ala Gin Pro Ser Ser Thr Phe Asp 

50 55 

Val lie Pro Ser Asn Thr Asp Tyr Pro Gly 

65 70 

Thr Phe Gin Gin Ser Ser Thr Ala Lys Ser 
85 90 

Pro Leu Leu Lys Lys Leu Tyr Cys Gin He 

100 105 

Gin He Lys Val Ser Thr Pro Pro Pro Pro 
115 120 

Met Pro Val Tyr Lys Lys Ala Glu His Val 

130 135 

Cys Pro Asn His Glu Leu Gly Arg Asp Phe 

145 ISO 



Ser Ser Arg Ala 

30 

Ser Val " Pro Thr 
45 

jr Met Ser Pro 
60 

Pro His His Phe 
75 

Ala ThrNyrp Thr 



Gin Phe 
15 

Ala Ser 



Ala Lys Th\cys 

10 

Gly Thr Ala i: 
125 

Thr Asp Val Val 
140 

Asn Glu Gly Gin 
155 



Pro Ala Ser His Leu He Arg Val Glu Gly Asn Asn Leu Ser 
165 170 



Val Asp Asp Pro Val Thr Gly Arg Gin S r 
190 185 

Pro Pro Gin Val Gly Thr Glu Phe Thr Thr 
195 200 



Val Val Val Pro 
190 

He Leu Tyr Asn 
205 



His Ser 

Ala Pro 

Glu Val 
80 

Tyr Ser 
95 

Pro He 

Arg Ala 

LVs Arg 

Ser 

Gin Tyi 
175 

Tyr Glu 
Phe Met 
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:y3 Asn Ser Ser Cys Val Gly Gly Met Asn Arg Arg Pro He Leu lie 
210 215 220 

lie lie Thr Leu Glu Met Arg Asp Gly Gin- Val Leu Gly Arg Arg Ser 
\25 230 235 ■ ^ ^ 240 

PhAGlu Gly Arg He Cys Ala Cys Pro" Gly Arg Asp Arg Lys Ala Asp 

^ 245 250 - 255 ■ ' 

Glu As^ His.Tyr Arg Glu Gin Gin Ala Leu Asn Glu Ser Ser Ala Lys 
260 265 270 

Asn Gly Alsa Ala Ser Lys Arg Ala Phe Lys Gin Ser Pro Pro Ala Val 
27\ 280 285 

Pro Ala Leu dly Ala' Gly Val Lys Lys Arg Arg His Gly Asp Glu Aso 
290 \ ' 295 300 

Thr Tyr Tyr Leu Nsin Val Arg Gly Arg Glu Asn Phe Glu lie Leu Met 
305 \ 310 315 320 

Lys Leu Lys Glu SeA^eu Glu Leu Met Glu Leu Val Pro' Gin Pro Leu 
325 \ -330 335 

Val Asp Ser Tyr Arg Gl\ Gin Gin Gin Leu Leu Gin Arg Pro Pro Arg 
340 \ 345 350 

Asp Ala Gin Gin Pro Trp p\o Arg Ser Ala Ser Gin Arg Arg Asp Glu 
355 \ 360 365 

Gin Gin Pro Gin Arg Pro Val his Gly Leu Gly Val Pro Leu His Ser 

370 . 37S \ 380 

Ala Thr Pro Leu Pro Arg Arg Pro^Gln Pro Arg Gin Asp Leu Gly Ala 
385 390 \ 395 400 



Leu Lys lie Pro Glu Gin Tyr Arg Met 
405 



He Trp .Ar.g Gly Leu Gin 
110 415 



Asp Leu Lys Gin Gly His Asp Tyr Ser Tl\ Ala Gin Gin Leu Leu Arg 
420 425 \ 430 

Ser Ser Asn Ala Ala Thr He Ser lie Gly <^y Ser Gly Glu Leu Gin 
435 440 \ 445 

Arg Gin Arg Val Met Glu Ala Val His Phe Arg \al Arg His Thr He 
450 455 41 

Thr He Pro Asn Arg Gly Gly Pro Gly Gly Gly Pro^Asp Glu Trp Ala 
465 470 475 \ 480 

Asp Phe Gly Phe Asp Leu Pro Asp Cys Lys Ala Arg Lyk Gin Pro He 
485 490 A . 495 

Lys Glu Glu Phe Thr Glu Ala Glu He His 
500 SOS 

INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1870 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS 



75 



(B) LOCATION: 104., 1367 



.(xi) SEQUENCE DESCRIPTION: SEO ID NO: 16: 

:CCOGGGC TGCGACGGCT' GCAGGGAACC AGACAGCACC TACTTCGACC TTCCCCAGTC 

AAGCCGGOQG AATAATGAGG TGGTGGGCGG AACGGATTCC AGC ATG GAC GTC TTC 

Met Asp Vai Phe 
1 

CAC CTG GAG'^GGC ATG ACT ACA TCT GTC ATG GCC CAG TTC AAT CTG CTG 

His Leu Glu Oly Met Thr Thr Ser Val Met Ala Gin Phe Asn Leu Leu 

5. \ 10 15 20 

AGC AGC ACC ATG ^GAC CAG ATG AGC AGC CGC GCG GCC TCG GCC AGC CCC 
Ser Ser Thr Met Asp Gin Met Ser Ser Arg Ala Ala Ser Ala Ser Pro 

30 35 



TAC ACC CCA GAG CAC 
Tyr Thr Pro Glu His 
40 



:C GCC AGC GTG CCC ACC CAC TCG CCC TAC GCA 
La Ala Ser Val Pro Thr His Ser Pro Tyr Ala 

45 -50 



CAA CCC AGC TCC ACC TTC l^C ACC ATG TCG CCG GCG CCT GTC ATC CCC 
Gin Pro Ser Ser Thr Phe A^ Thr- Met Ser Pro Ala Pro Val lie Pro 

55 ■ . \ 60 ■ 65' 

TCC AAC ACC GAC TAC CCC GGA ?S^C CAC CAC TTT GAG GTC ACT TTC CAG 
Ser Asn Thr Asp T.yr Pro Gly Pfi^ His His Phe Glu Val Thr Phe Gin 
70 75 \ 80 

CAG TCC AGC ACG GCC AAG TCA GCC ACC TGG ACG TAC TCC CCG CTC TTG 
Gin Ser Ser Thr Ala Lys Ser Ala T«c Trp Thr Tyr Ser Pro Leu Leu 

35 90 \ 95 . ■ 100 

AAG AAA. CTC TAC TGC CAG ATC GCC AAG ACA TGC CCC ATC CAG ATC AAG 
Lys Lys Leu Tyr Cys Gin lie Ala Lys THt Cys Pro He Gin He Lys 
105 110\ 115 

GTG TCC ACC CCG CCA CCC CCA GGC ACT GCC J^C CGG GCC ATG CCT GTT 
Val Ser Thr Pro Pro Pro Pro Gly Thr Ala Il\ Arg Ala Met Pro Val 
120 125 130 

fAC AAG AAA GCG GAG CAC GTG ACC GAC GTC GTG Aa CGC TGC CCC AAC 
Tyr Lys Lys Ala Glu His Vai Thr Asp Val Val LyXArg Cys Pro Asn 
135 140 \145 

'CAC GAG CTC GGG AGG GAC TTC AAC GAA GGA CAG TCT GOff CCA GCC AGC 
His Glu Leu Gly Arg Asp Phe Asn Glu Gly Gin Ser Alk Pro Ala Ser 
150 155 160 

CAC CTC ATC CGC GTG GAA GGC AAT AAT CTC TCG CAG TAT GtG GAT GAC 
His Leu lie Arg Val Glu Gly Asn Asn Leu Ser Gin Tyr VaPvAsp Asp 
1^5 170 175 \ 180 

CCT GTC ACC GGC AGG CAG AGC GTC GTG GTG CCC TAT GAG CCA C^ CAG 
Pro Val Thr Gly Arg Gin Ser Val Val Val Pro Tyr Glu Pro Pr^Gln 
185 190 195 

GTG GGG ACG GAA TTC ACC ACC ATC CTG TAC AAC TTC ATG TGT AAC A^- 
Val Gly Thr Glu Phe Thr Thr He Leu Tyr Asn Phe Met Cys Asn Sei 

200 205 210 

AGC TGT GTA GGG GGC ATG AAC CGG CGG CCC ATC CTC ATC ATC ATC ACC 
Ser Cys Val Gly Gly Met Asn Arg Arg Pro II Leu He He He Thr 
215 220 225 

CTG GAG ATG CGG GAT GGG CAG GTG CTG GGC CGC CGG TCC TTT GAG GGC 
Leu Glu Met Arg Asp Gly Gin Val Leu Gly Arg Arg Ser Phe Glu Glv 
230 235 240 



60 
115 

163 

211 

259 

307 

355 

403 

451 

499 

547 

595 

643 

691 

739 



CGC ATC TGC GCC TGT CCT GGC CGC GAC CGA AAA GCT GAT GAG GAC CAC 



883\ 



76 



Arg lie 
245 

TAG CGG 
Tyr Arg 



Cys Ala 



GAG CAG 
Glu Gin 



GCC AGC 
Ala Ser 



GGT GCC 
Gly Ala 



GTT CAG 
Leu Gin 
310 

GAG AGC 
Giu Ser 

•325 

TAT CGG 
Tyr Arg 



.^^AG CGT 
Ly3\Arg 
0 

GGT GT( 
Gly Val^ 
295 

GTG CGA 
Val Arg 



Cys Pro 
250 

CAG GCC 
.Gin Ala 
265 

GCC TTC 
Ala ?he 



Gly Arg Asp Arg 



CTG .AAC 
Leu Asn 



AAG CAG 
Lys Gin 



AAG AAG 
^Ly3 Lys 

CGG 
Gl y\Arg 



CTG GAG 
Leu Glu 



CAG CAG 
Gin Gin 



CCG TCC 
Pro Ser 



ATG AAC 
Met Asn 



CAC AGT 
His Ser 
390 



TAC GGG 
Tyr Gly 
360 

AAG CTG 
Lys Leu 
375 

TCG GCA 
Ser Aia 



CTG ATi 
Leu Me 
330 

CAG CAG 
Gin Gin 
345 

CCG GTC 
Pro Val 



CCC TCC 
Pro Ser 



GCT ACA 
Ala Thr 



CGG CGG 
Arg Arg 
300 

GAG AAC 
Glu Asn 
315 

GAG TTG 
Lu Leu 



GAG AGC 
Glu Ser 
270 

AGC CCC 
Ser Pro 
235 

CAT GGA 
His Gly 



Lys Ala Asp 
255 

TCC GCC AAG 
Ser Ala Lys 



Glu 



AAC 
Asn 



CCT GCC GTC 
Pro Ala Val 



GAC 
Asp 



TTT GAG 
Phe Glu 



GTG CCG 
Val Pro 



CTO. CTA 
LeuV^eu 



CTC TCC ^ 
Leu Ser' 



GTC AAC 
Val Asn 
380 

CCC AAC 
Pro Asn 
395 



CAG AGG 
Gin Arg 
350 

ATG 
pro Met 
55 

CAGVvCTG 
Gin 



ATC 
lie 



CAG 
Gin 
335 

CCG 
Pro 



GAG GAC 
Glu Asp 
305 

CTG ATG 
Leu Met 
320 

CCA CTG 
Pro Leu 



CCC 
Pro 
290 

ACG 
Thr 



Asp His 
260 

GGG GCC 
Gly Ala 
275 

GCC CTT 
Aia Leu 



TAC TAC 
Tyr Tyr 



AAG CTG AAA 
Lys Leu Lys 



AGT CAC 
Ser His 



CTG GGG 
Leu Gly 



AAC AAG GTG 
Asn Lys Val 



GTG GGC CAG 
Val Gly Gin 
385 

GTG GGC 
Val Gly 
400 



GTG GAC TCC 
Val Asp Ser 
340 

CTA CAG CCC 
Leu Gin Pro 
355 

CAC GGG GGC 
His Gly Gly 
370 

CCT CCC .CCG 
Pro Pro Pro 



CCC GGG ATG 
Pro Gly Met 



CTC AAC 
Leu Asn 
405 

AGC CAC 
Ser His 



AAC CAT 
Asn His 



AGC GCC 
Ser Ala 



CCC CCC 
Pro Pro 



GGG TGT 
Gly Cys 



ATT TAC 
lie Tyr 
470 

ATC CCC 
He Pro 
485 

AAG CAG 
Lys Gin 



TAC CAC 
Tyr His 
440 

CCA AAC 
Pro Asn 
455 

CAC CTG 
His Leu 



GAG CAG 
Glu Gin 



GGC CAC 
Gly His 
410 

CAG TCC 
Gin Ser 
425 

GCC GAC 
Ala Asp 



GCA GTG 
Ala Val 



ATG GTC 
Met Val 



CCC AGC 
Pro Ser 



TGC ATC 
Cys He 



CAG AAC 
Gin Asn 



TAC 
Tyr 



GGC CAC 
Gly His 



AAC GCG 
Asn Ala 



GCC ACC 
Ala Thr 
520 



GAC 
Asp 

505 

ATC 
He 



CGC 
Arg 
490 

TAC 
Tyr 



GAG TAT 
Glu Tyr 
460 

CTG ACC 
Leu Thr 
475 

ATG ACC 
Met Thr 



CCA GCC 
Pro Ala 



TCG GGG 
Ser Gly 
430 

CTC GTC 
Leu Val 
445 

TTC ACC 
Phe Thr 



AAC 
Asn 
415 

TCC CAC 
Ser His 



GAG 
Glu 



ATG AGC 
Met Ser 



ACT CCG 
Thr Pro 
435 



AGT TTT TTA 
Ser Phe Leu 



GGA 

.Gly 



AGC 
Ser 

420 

CCA 
Pro 



TTG 
Leu ^ 



TCC CAA 
Ser Gin 



ATT GAG 
He Glu 



ATC TGG 
He Trp 



TCC 
Ser 



AGC ACC 
Ser Thr 



ATC GGC- 
He Gly 



GCG CAG 
Ala Gin 
510 

GGC TCA 
Gly Ser 
525 



GAC CTG 
Asp Leu 
480 

CGG GGC 
Arg Gly 
495 

CAG CTG 
Gin Leu 



GGG 
Gly 
465 

GGG 
Gly 



CTG 
Leu 



CTC 
Leu 



TTA CA^ 
Leu 



AGC 
L Ser 



GCC CTG 
Ala Leu Lyi 



GGG GAA CTG 
Gly Glu Leu 



CAG GAC 
Gin Asp 



CGC TCT 
Arg Ser 
515 

CAG CGC 
Gin Arg 
530 



CTG 
Leu 

500 

AGC 
Ser 



CAG 
Gin 



931 



979 



1027 



1075 



1123 



1171 



1219 



1267 



1315 



1363 



1411 



1459 



1507 



1555 



CGG GTC ATG GAG GCC GTG CAC TTC CGC GTG CGC CAC ACC ATC ACC ATC 



169S 



1747 



77 



Arg val Met Glu Ala Val His Phe Arg Val Arg His Thr He Th- He 
535 540 545 

CCC\AAC CGC GGC GGC CCA GGC GGC GGC CCT GAC GAG TGG GCG GAC TTC 
•^ro A^"? Pro Giy Oly Giy Pro Asp Glu Trp Ala Asd Phe 

555 5 60 ' ' 



GGC TTC\GAC CTG CCC GAC TGC AAG GCC CGC AAG CAG CCC ATC AAG GAG 
Gly Phe 7^3 p Leu Pro Asp Cys Lys Ala Arg Lys Gin Pro He Lys Glu 

570 575 580 



56 = 



GAG TTC ACgNgAG GCC GAG ATC CAC TGA 
Glu Phe Thr Oj.u Ala Glu lie His 
585 



1795 



1343 



1370 



(2) INFORMATION POR SEQ ID NO: 17: 

(i) SEQUENck CHARACTERISTICS: 

(A) LENGTHy: 588. amino acids 

(B) TYPE: Wno acid 
(D) TOPOLOgX: linear 



(ii) MOLECULE TYPE:Vrotein 

(xi) SEQUENCE DESCRIPTSION : SEQ ID NO: 17: 

Met Asp Val Phe His Leu GluAGly Met Thr Thr Ser Val Met Ala Gin 

i 5 \ 10 . 15 

Phe Asn Leu Leu Ser Ser Thr Me\ Asp Gin Met Ser Ser Arg Ala Ala 
20 \ 25 30 

Ser Ala Ser Pro Tyr Thr Pro Glu }hs Ala Ala Ser Val Pro Thr His 
35 40 \ 45 

Ser Pro Tyr Ala Gin Pro Ser Ser ThrVhe Asp Thr Met Ser Pro Ala 
50 55 \ 60 

Pro Val He Pro Ser Asn Thr Asp Tyr Pr\ Gly Pro His His Phe Glu 

^5 70 \ 75 "80 

Val Thr Phe Gin Gin Ser Ser Thr Ala Lys sSar Ala Thr Trp Thr Tyr 
85 90 \ 95 

Ser Pro Leu Leu Lys Lys Leu Tyr Cys Gin IleVla Lys Thr Cys Pro 
100 105 \ 110 

He Gin He Lys Val Ser Thr Pro Pro Pro Pro Gl\ Thr Ala He Ara 
115 120 \i25 

Ala Met Pro Val Tyr Lys Lys Ala Glu His Val Thr Val Val Lys 
130 135 ^ 

Arg Cys Pro Asn His Glu Leu Gly Arg Asp Phe Asn Glu\ly Gin Ser 

150 155 Y 160 

Ala Pro Ala Ser His Leu He Arg Val Glu Gly Asn Asn Le\ Ser Gin 
165 170 \i7S 

Tyr Val Asp Asp Pro Val Thr Gly Arg Gin Ser Val Val Val P^o Tvr 
180 185 190 \ 

Glu Pro Pro Gin Val Gly Thr Glu Phe Thr Thr He Leu Tyr Asn\>he 
155 200 205 

Met Cys Asn Ser Ser Cys Val Gly Gly Met Asn Arg Arg Pro He Lei 
^10 215 220 

He He He Thr Leu Glu Met Arg Asp Gly Gin Val Leu Gly Arg Arg 

230 235 240 
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Ser Phe Glu Gly Arg lie Cys Ala Cys Pro Gly 
245 250 

Asp CTiu Asp His Tyr Arg Glu Gin Gin Ala Leu 
^ 260 265 

Lys Asn C^y Ala Ala Ser Lys Arg Ala Phe Lys 
2\5 280 

Val Pro Ala\Leu Gly Ala Gly Val Lys Lys Arg 
290 \ 295 



Arg Asp Arg Lys 
255 

Asn Glu Ser Ser 

270 

Gin Ser Pro Pro 
285 

Arg His Gly Asp 
300 



Asp Thr Tyr Ty\ Leu Gin Val Arg Gly Arg Glu Asn Phe Glu lie 
305 \ 310 315 



Met Lys Leu Lys g\u Ser Leu Glu Leu Met Glu 
32\ 330 

Leu Val Asp Ser Tyr Vrg Gin Gin Gin Gin Leu 
340 \ 345 

His Leu Gin Pro Pro SerVTyr Gly Pro Val Leu 
355 \ 360 

Val His Gly Gly Met Asn Lyss Leu Pro Ser Val 

■ 370 -37? 

Gin Pro Pro Pro His Ser Ser Ala Ala Thr Pro 
385 390 \ 395 

Gly Pro Gly Met Leu Asn Asn HisAGly His Ala 
405 \ 410 



Leu Val Pro Gin 
335 

Leu Gin Arg Pro 

. 350 

Ser Pro Met Asn 
365 

Asn Gin Leu - Val 
380 

Asn Leu Gly Pro 



Ala 
Ala 
Ala 
Glu 



Leu 
320 

Pro 



Glu Met Ser Ser Ser His Ser Ala Gj>a Ser Met 
420 421 



Cys Thr Pro- Pro Pro Pro Tyr His Ala Xsp Pro 
435 440 

Leu Thr Gly. Leu Gly Cys Pro Asn Cys Il^Glu 
450 455 ^ 

sly Leu Gin Ser He Tyr His Leu Gin Asn 
^465 470 4' 

^ly Ala Leu Lys He Pro Glu Gin Tyr Arg Met 
485 490 

Leu Gin Asp Leu Lys Gin Gly His Asp Tyr Ser 
500 505 

Leu Arg Ser Ser Asn Ala Ala Thr He Ser He 
515 520 

Leu Gin Arg Gin Arg Val Met Glu Ala Val His 
530 535 

Thr He Thr He Pro Asn Arg Gly Gly Pro Gly 
545 550 555 

Trp Ala Asp Phe Gly Phe Asp Leu Pro Asp Cys 

565 570 

Prci He Lys Glu Glu Phe Thr Glu Ala Glu He 
580 585 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1817 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 



Val Pro Ala Asn 

415 

Val Ser Gly Ser 
430 

Ser Leu Val Ser 
445 

Tyr Phe Thr Ser 
460 

Thr He Glu Asp 



He Trp Arg 
495 

Thr\Ala Gin Gin 
510 



Gly Gly Ser Gly 
5 



Phe Arg \^1 Arg 

540 ~ 

Gly Gly Pro \sp 



Lys Ala Arg Lys^ 
575 

His 



Ser 

Lys 

Gly 

Val 
400 

Gly 

His 

Phe 

Gin 

Leu 
480 

Gly 
Leu 
Glu 
His 



Glu 
560 



• 
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(D) TOPOLOGY: linear 

(ii) ^toLECULE TYPE: cDNA 

• (vi) ORliSINAL SOURCE: ■ 

(A)V)RGANISM: Homo sapiens 

(xi) SEQUENCk DESCRIPTION: SEQ ID NO: X8 : 
ATGGCCCAGT CCACCGc\aC CTCCCCTGAT GGGGGCACCA CGTTTGAGCA 
TCTCTGGAAC CAGACAGCAQ! CTACTTCGAC CTTCCCCAGT CAAGCCGGGG 

gtggtgggcg gaacggattcVagcatggac GTCTTCCACC TGGAGGGCAT 

GTCATGGCCC AGTTCAATCT G^TGAGCAGC ACCATGGACC AGATGAGCAG 
tcggccagcc cctacacccc ag;J^acgcc GCCAGCGTGC CCACCCACTC 
CAACCCAGCT CCACCTTCGA CACc\tGTCG CCGGCGCCTG TCATCCCCTC 

taccccggac cccaccactt tgaggtWct TTCCAGCAGT CCAGCACGGC 
acctggacgt actccccgct CTTGAAGM^A ctctactgcc agatcgccaa 
ATCCAGATCA AGGTGTCCAC CCCGCCACCO. ccaggcactg ccatccgggc 
tacaagaaag cggagcacgt gaccgacgtc mgaaacgct gccccaacca 
agggacttca acgaaggaca gtctgctcca gocagccacc tcatccgcgt 

AATCTCTCGC AGTATGTGGA TGACCCTGTC ACCdGCAGGC AGAGCGTCGT 

gagccaccac aggtggggac ggaattcacc accat\ctgt ACAACTTCAT 

AGCTGTGTAG GGGGCATGAA CCGGCGGCCC ATCCTCAtCA TCATCACCCT 
GATGGGCAGG TGCTGGGCCG CCGGTCCTTT GAGGGCCGC^A TCTGCGCCTG 
GACCGAAAAG CTGATGAGGA CCACTACCGG GAGCAGCAGgVcCTGAACGA 

aagaacgggg ccgccagcaa gcgtgccttc aagcagagcc c\cctgccgt 

GGTGCCGGTG TGAAGAAGCG GCGGCATGGA GACGAGGACA CG-niCTACCT 
GGCCGGGAGA ACTTTGAGAT CCTGATGAAG CTGAAAGAGA GCCTGfeAGCT 

GTGCCGCAGC cactggtgga ctcctatcgg cagcagcagc agctcc^^ca 
cacctacagc ccccgtccta cgggccggtc CTCTCGCCCA TGAAi 
atgaacaagc tgccctccgt caaccagctg gtgggccagc ctcccccgca 

GCTACACCCA ACCTGGGGCC CGTGGGCCCC GGGATGCTCA ACAACCATGG 
CCAGCCAACG GCGAGATGAG CAGCAGCCAC AGCGCCCAGT CCATGGTCTC 
TGCACTCCGC CACCCCCCTA CCACGCCGAC CCCAGCCTCG TCAGGACCTG 
AGATCCCCGA GCAGTACCGC ATGACCATCT GGCGGGGCCT GCAGGACCTG 
ACGACTACAG CACCGGGCAG CAGCTGCTCC GCTCTAGCAA CGCGGCCACC 
GCGGCTCAGG GGAACTGCAG CGCCAGCGGG TCATGGAGGC CGTGCACTTC 
ACACCATCAC CATCCCCAAC CGCGGCGGCC CAGGCGGCGG CCCTGACGAG 
TCGGCTTCGA CCTGCCCGAC TGCAAGGCCC GCAAGCAGCC CATCAAGGAG 
AGGCCGAGAT CCACTGA 



cctctggagc 
gaataatgag 
gactacatct 
ccgcgcggcc 
gccctacgca 
caacaccgac 
caagtcaggc 
gacatgcccc 
catgcctgtt 
cgagctcggg 
ggaaggcaat 
ggtgccctat 
gtgtaacagc 
•ggagatgc gg 
tcctggccgc 
gagctccgcc 
ccccgccctt 
tcaggtgcga 
gatggagttg 
gaggccgagt 
gcacgggggc 
:agttcggca 
coacgcagtg 

toCCCAC 
GGGGc\cTGA 
AAGCAGGfeCC 
ATCTCCATC 
CGCGTGCGCC 
TGGGCGGACT 
GAGTTCACGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
30 
18X7 



• 
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) IMFORMATICN FOR SEQ ID NO: 19; 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 99 amino acids 
(3) TYPE: amino acid 
(D) TOPOLOGY: linear 

.'11) >>OLECULE TYPE: protein 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



Met Ala Glh Ser Thr Ala Thr Ser Pro Asp Gly Gly Thr Thr Phe Glu 
1 \ 5 . 10 IS 

His Leu Trp ^r Ser Leu Glu Pro Asp Ser Thr Tyr Phe Asp Leu Pro 

2\ - 25 .30 

Gin Ser Ser Arg\Gly Asn Asn Glu Val Val Gly Gly Thr Asp Ser Ser 

' 35 \ 40 .- 45 ■ 

Met Asp Val Phe HjSs Leu Glu Gly Met Thr Thr Ser Val Met Ala Gin 
50 \ 55 60 

Phe Asn Leu Leu Ser ^r Thr Met Asp Gin Met Ser Ser Arg Ala Ala 

65 7\) _ -75 -30 

Ser Ala Ser Pro Tyr ThA Pro Glu His Ala Ala Ser Val Pro Thr His 
35 \- 90 95 

Ser Pro Tyr Ala Gin Pro S^ Ser Thr Phe Asp Thr Met Ser Pro Ala 

100 \ 105 110 

Pro Val lie Pro Ser Asn Thr \sp Tyr Pro" Gly Pro His His Phe Glu 

i>o 125 

Val Thr Phe Gin Gin Ser Ser ThrVAla Lys Ser Ala Thr Trp Thr Tvr 
130 135 \ 140 

Ser Pro Leu Leii Lys Lys Leu Tyr c\ Gin lie Ala Lys Thr Cys Pro 
- 150 \ 155 160 

He Gin He Lys Val Ser Thr Pro Pro ^o Pro Gly Thr Ala^ He Arg 
165 ikl _ 175 

Ala Met Pro Val Tyr Lys Lys Ala Glu HisVal Thr Asp Val Val Lys 
180 185 \ 190 

Arg Cys Pro Asn His Glu Leu Gly Arg Asp Ph\ Asn Glu Gly Gin Ser 
195 200 \ 205 

Ala Pro Ala Ser His Leu He Arg Val Glu Gly ^n Asn Leu Ser Gin 
210 215 z: 

Tyr Val Asp Asp Pro Val Thr Gly Arg Gin Ser Val\al Val Pro Tyr 



230 



235 



240 



Glu Pro Pro Gin Val Gly Thr Glu Phe Thr Thr He Le\ Tyr Asn Phe 

245 250 y 255 

Met Cys Asn Ser Ser Cys Val Gly Gly Met Asn Arg Arg )^o He Leu 
260 265 27^ 

He He He Thr Leu Glu Met Arg Asp Gly Gin Val Leu GlyVra Ara 
275 280 285 ^ 

Ser Phe Glu Gly Arg He Cy^ Ala Cys Pro Gly Arg Asp Arg Ly\ Ala 



295 



300 



Asp Glu Asp His Tyr Arg Glu Gin Gin Ala Leu Asn Glu Ser Ser 

^05 310. 315 
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L ys) T^an 

Val Pro 

Asp Thr 

Met Lys 
370 

Leu Val 
335 

His Leu 
Val His 
Gin Pro 

Gly Pro 

■ 450 

Glu Met 
465 

Cys Thr 
Trp Gly 



Gly Ala Ala 
325 

^Ala Leu Gly 
340 

Ty\ Tyr Leu 

355 

Leu lVs Glu 

Asp Ser \yr 

Gin Pro Pre 
405 

Gly Gly Met 

420 

Pro Pro His 
435 

Gly Met Leu 
Ser Ser Ser 



Ser Lys 

Ala Gly 

Gin Val 

Ser Leu 
375 

Arg Gin 
390 

Ser Tyr 
Ai« Lys 



Arg Ala Phe Lys 
330 

Val Lys Lys Arg 
345 

Arg Gly Arg Glu 
360 

Glu Leu Met Glu 



Pro Pro Pro 
485 

Pro 



Asn Asn 
455 

His Ser 
470 

Pro Tyr 



Gin Gin Gin Leu 
395 

Gly Pro Val Leu 
410 

Leu Pro Ser Val 
425 

Ala Ala Thr Pro 
440 

Ls Gly His Ala 



Ala\Gln Ser Met 
475 

His Al\ Asp Pro 
\4 90 



Gin Ser Pro 



Arg His Gly 
350 

Asn Phe Glu 
365 

Leu Val Pro 
380 

Leu Gin Arg 



Ser Pro Met 

Asn Gin Leu 

430 

Asn Leu Gly 
445 

Val Pro Ala 
4 60 

Val Ser Gly 
Ser Leu Val 



Pro Ala 
335 

Asp Glu 

lie Leu 

Gin Pro 

Pro Ser 
400 

Asn Lys 
415 

Val Gly 
Pro Val 
Asn Gly 



Ser His 
480 

Arg Thr 
495 




(2) INFOF^MATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA 
(ili) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GCGAGCTGCC CTCGGAG 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



• 
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GGTTCTGd)\G GTGACTCAG 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQ^NCE CriARACTERISTICS: 
(A)\lengtH: 18 base pairs" 
(3) nucleic acid 

(C) s\rANDEDNESS: single 

(D) T0V:>L0GY: linear 

(ii) MOLECULE \yPE: DNA 

(iii) anti-sense\no 
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(xi) SEQUENCE DESCRXpTION: SEQ ID NO: 22: 
GCCATGCCTG TCTACAAG 

(2) INFORMATION FOR SEQ ID \0: 23: 

{!) SEQUENCE CHARACTERI^ICS: 

(A) LENGTH: 18 baseXpairs 

(B) TYPE: nucleic aca 

(C) STRANDEDNESS: 3in(3ae 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(iii) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID N(\ 23; 
ACCAGCTGGT TGACGGAG 

(2)^ INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GTCAACCAGC TGGTGGGCCA G 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(iii) ANTI-SENSE: YES 




18 



18 



21 



(xi) SEQUENCE DESCRIPTION: SEQ. ID NO: 25: 



V 
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GTGGATCTCG GCCTCC 

(2) IV|-ORMATION FOR SEQ ID UO: 26: 

(i\s£QUENCE CHARACTERISTICS: 

^{A) LENGTH: 17 base pairs' 
^) 'TYPE: nucleic acid 
{\) 3TPANDEDNESS : single 
(d\ TOPOLOGY: linear 

(ii) MOLEOULS TYPE: DNA 

(iii) ANTI-s\nSE: NO 
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(xi) SEQUENCE D^fSCRIPTION: SEQ ID NO: 26: 
AGGCCGGCGT' GGGGAAG 

(2) INFORMATION FOR SE>C ID NO: 27: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19\a3e pairs 
(3) TYPE: nucleft acid 

(C) STRANDEDNESS.\ single 

(D) TOPOLOGY: lin^r 

(ix) MOLECULE TYPE: DNA 
(iii) ANTI-SENSE: YES 




(xi) SEQUENCE DESCRIPTION: SEQ 
CTTGGCGATC TGGCAGTAG 
(2) INFORMATION FOR SEQ ID NO: 28: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

.(ii) MOLECULE TYPE: DNA 
(iii) ANTI-SENSE: NO 



NO: 27: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GCGGCCACGA CCGTGAC 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: . nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 29: 
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GGCAGCTTGG\GTCTCTGG 

(2) INFORMAT^N FOR SEQ ID NO: 30: 

(i) seque>k:e CHAPACTERISTICS: 
(A) LENGTH: 18 base pairs 
(3) TYSjE: nucleic acid 

(C) StRJ^DEDNESS: single 

(D) TOPoYoGY: linear 

(ii) MOLECULE TY^E: DMA 
(Lii) ANTI-SENSE: 



18 





(XI) SEQUENCE DESCRIPP^ON: SEQ ID NO: 30: 
CTGTACGTCG GTGACCCC . 
(2) INFORMATION FOR SEQ ID NO:\31: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 18 base paiXs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA 
(ill) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
TCAGTGGATC TCGGCCTC 

(2/ INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHAPACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA ' 
Ciii) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32; 
AGGGGACGCA GCGAAACC 

(2) INFORMATION FOR SEQ ID NO; 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(iii) ANTI-SENSE: YES 
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(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 33: 
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CCATCAGCTC CAGGCTCTC 

(2) I>)S0RMATI0N FOR 5ZQ ID MO: 34: " 

{ ].)\SEQUENCE CHJ^lRACTERISTICS : 
(A) LENGTH:- 18 base pairs 
(3) TYPE: nucleic acid 
\C) STPANDEDNESS': single 
i) TOPOLOGY: linear 

(li) MOLECULE TYPE: DMA 

[iii) ANTI-3»£:NSE: YES 
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(xi) SEQUENCE Bi£SCRIPTION: SEQ ID NO: 34: 
CCAGGACAGG CGCAGATG 

(2) INFORMATION FOR SE^vID NO: 35; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 y&se pairs 

(B) TYPE: nuclei<\acid 

(C) STRANDEDNESS : \ingle 

(D) TOPOLOGY: lineal 

(ii) MOLECULE TYPE: DNA 
(lii) ANTI-SENSE: YES 
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(xi) SEQUENCE DESCRIPTION: SEQ ID^ 
GATGAGGTGG CTGGCTGGA 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA 
(iii) ANTI-SENSE: YES 



35: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
TGGTCAGGTT CTGCAGGTG 
(2) INFORMATION FOR SEQ ID NO; 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(lii) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
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CACCTACTCiS AGGGATGC 

(2) INFORMOTION FOR SEQ ID NO: 38: 

(1) SEQUENCE CHARACTERISTICS: 
(a\ LENGTH: 21 base pairs 

(B) \TyPE: nucleic acid 

(C) VSTPANDEDNESS: single 

(D) VoPOLOGY: linear 

(ii) MOLECITLE TYPE: DNA ' 
(iii) ANTI-SENSE: YES 



(xi) SEQUENCE l\E5CRIPTI0N : SEQ ID NO:' 38; 
AGGAAAATAG AAGCGTCAC^T C 
(2) INFORMATION FOR s\q ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:. VB base pairs 

(B) TYPE: nucleic acid 
• (C) STRANDEDNESS : single 

(b) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNJ^ 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRI PTION :\sEQ ID NO: 39: 
CAGGCCCACT TGCCTGCC 

(2) INFORMATION FOR SEQ ID NO: 4^: 



(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 19 base pair! 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
(iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: MO: 
CTGTCCCCAA GCTGATGAG \ 



